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POLYPEPTIDB-POLYMER CONJUGATES HAVING ADDED AND/OR REMOVED ATTACHMENT GROUPS 

FIELD OF THE INVENTION 

The present invention relates to polypeptide-polymer 
5 conjugates having added and/or removed one or more attachment 
groups for coupling polymeric molecules on the surface of the 3D 
structure of the polypeptide, a method for preparing polypeptide- 
polymer conjugates of the invention, the use of said conjugated 
for reducing the immunogenicity and allergenic**. and 
10 compositions comprising said conjugate. 

BACKGROUND OF THE INVENTION 

The use of polypeptides, including enzymes, in the 
circulatory system to obtain a particular physiological effect is 
15 well-known in the medical arts. Further, within the arts of 
industrial applications, such as laundry washing, textile 
bleaching, person care, contact lens cleaning, food and feed 
preparation enzymes are used as a functional ingredient. One of 
the important differences between pharmaceutical and industrial 
20 application is that for the latter type of applications (i.e. 
industrial applications) the polypeptides (often enzymes) are not 
intended to enter into the circulatory system of the body. 

Certain polypeptides and enzymes have an unsatisfactory 
stability and may under certain circumstances - dependent on the 
25 way of challenge - cause an immune response, typically an IgG 
and/ or IgE response. 

It is today generally recognized that the stability of 
polypeptides is improved and the immune response is reduced when 
polypeptides, such as enzymes, are coupled to polymeric molecules. 
3 0 It is believed that the reduced immune response is a result of the 
shielding of (the) epito P e(s) on the surface of the polypeptide 
responsible for the immune response leading to antibody formation 
by the coupled polymeric molecules. 

Techniques for conjugating polymeric molecules to polypeptides 

35 are well-known in the art. 

One of the first suitable commercially techniques was described 
back in the early 1970 -ies and disclosed in e.g. US patent no. 
4 179,337. Said patent concerns non- immunogenic polypeptides, such 
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as enzymes and peptide hormones coupled to polyethylene glycol 
(PEG) or polypropylene glycol (PPG). At least 15% of polypeptides' 
physiological activity is maintained. 

GB patent no. 1,183,257 (Crook et al.) describes chemistry for 
5 conjugation of enzymes to polysaccharides via a triazine ring. 

Further, techniques for maintaining of the enzymatic activity 
of enzyme-polymer conjugates are also known in the art. 

WO 93/15189 (Veronese et al.) concerns a method for maintaining 
the activity in polyethylene glycol-modif ied proteolytic enzymes 
10 by linking the proteolytic enzyme to a macromolecularized 
inhibitor. The conjugates are intended for medical applications. 

It has been found that the attachment of polymeric molecules to 
a polypeptide often has the effect of reducing the activity of the 
polypeptide by interfering with the interaction between the 
15 polypeptide and its substrate. EP 183 503 (Beecham Group PLC) 
discloses a development of the above concept by providing 
conjugates comprising pharmaceutical ly useful proteins linked to 
at least one water-soluble polymer by means of a reversible 
linking group. 

20 EP 471,125 (Kanebo) discloses skin care products comprising a 
parent protease (Bacillus protease with the trade name Esperase®) 
coupled to polysaccharides through a triazine ring to improve the 
thermal and preservation stability.' The coupling technique used is 
also described in the above mentioned GB patent no. 1,183,257 

25 (Crook et al.). 

JP 3 083908 describes a skin cosmetic material which 
contains a transglutaminase from guinea pig liver modified with 
one or more water-soluble substance such as PEG, starch, 
cellulose etc. The modification is performed by activating the 

30 polymeric molecules and coupling them to the enzyme. The 
composition is stated to be mild to the skin. 

However, it is not always possible to readily couple 
polymeric molecules to polypeptides and enzymes. Further, there is 
still a need for polypeptide-polymer conjugates with an even more 

35 reduced immunogenicity and/or allergenicity . 

SUMMARY OF THE INVENTION 

It is the object of the present invention to provide improved 
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polypeptide-polymer conjugates suitable for industrial and 

pharmaceutical applications. 

The term "improved polypeptide-polymer conjugates" means in the 

context of the present invention conjugates having a reduced 
5 immune response in humans and animals and/or a improved stability. 

As will be described further below the immune response is 

dependent on the way of challenge. 

The present inventors have found that polypeptides, such as 

enzymes, may be made less immunogenic and/ or allergenic by adding 
10 and/ or removing one or more attachment groups on the surface of 

the parent polypeptide to be coupled to polymeric molecules. 

When introducing pharmaceutical polypeptide directly into the 

circulatory system (i.e. bloodstream) the potential risk is an 

immunogenic response in the form of mainly IgG, IgA and/ or IgM 
15 antibodies. In contrast hereto, industrial polypeptides, such as 

enzymes used as a functional ingredient in e.g. detergents, are 

not intended to enter the circulatory system. The potential risk 

in connection with industrial polypeptides is inhalation causing 

an allergenic response in the form of mainly IgE antibody 
20 formation. 

Therefore, in connection with industrial polypeptides the 
potential risk is respiratory allergenicity caused by inhalation, 
intratracheal and intranasal presentation of polypeptides. 

The main potential risk of pharmaceutical polypeptides is 
25 immunogenicity caused by intradermally , intravenously or subcu- 
taneous ly presentation of the polypeptide. 

It is to be understood that reducing the "immunogenicity" 
and reducing the "respiratory allergenicity" are two very 
different problems based on different routes of exposure and on 
3 0 two very different immunological mechanisms: 

The term "immunogenicity" used in connection with the 
present invention may be referred to as allergic contact 
dermatitis in a clinical setting and is a cell mediated delayed 
immune response to chemicals that contact and penetrate the skin. 
35 This cell mediated reaction is also termed delayed contact - 
hypersensitivity (type IV reaction according to Gell and Combs 
classification of immune mechanisms in tissue damage) . 

The term "allergenicity" or "respiratory allergenicity" is an 
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immediate anaphylactic reaction (type I antibody-mediated reaction 
according to Cell and Combs) following inhalatxon of e.g. 
polypeptides. 

According to the present invention it is possible to provide 
5 polypeptides with a reduced immune response and/or improved 
stability, which has a substantially retained residual activity. 

The allergic and the immunogenic response are in one term, at 
least in the context of the present invention called the "immune 
response" . 

10 in the first aspect the invention relates to a polypeptide- 
polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 

15 comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 

20 of the polypeptide in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

The term "parent polypeptide" refers to the polypeptide to be 
modified by coupling to polymeric molecules. The parent 
polypeptide may be a naturally-occurring (or wild-type) 

2 5 polypeptide or may be a variant thereof prepared by any suitable 
means. For instance, the parent polypeptide may be a variant of a 
naturally-occurring polypeptide which has been modified by 
substitution, deletion or truncation of one or more amino acid 
residues or by addition or insertion of one or more amino acid 

30 residues to the amino acid sequence of a naturally-occurring 
polypeptide. 

A "suitable attachment group" means in the context of the 
present invention any amino acid residue group on the surface of 
the polypeptide capable of coupling to the polymeric molecule in 
35 question. 

Preferred attachment groups are amino groups of Lysine 
residues and the N-terminal amino group. Polymeric molecules may 
also be coupled to the carboxylic acid groups (-COOH) of amino 
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acid residues in the polypeptide chain located on the surface. 
Carboxylic acid attachment groups may be the carboxylic acid group 
of Aspartate or Glutamate and the C-terminal COOH-group. 

A "functional site" means any amino acid residues and/or 
5 cofactors which are known to be essential for the performance of 
the polypeptide, such as catalytic activity, e.g. the catalytic 
triad residues, Histidine, Aspartate and Serine in Serine 
proteases, or e.g. the heme group and the distal and proximal 
Histidines in a peroxidase such as the Arthromyces ramosus 
10 peroxidase. 

In the second aspect the invention relates to a method for 
preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
15 3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a 

20 suitable attachment group, and/or 

ii) substituting or deleting one or more amino acid residues 
selected in step b) at or close to the functional site(s), 

d) coupling polymeric molecules to the mutated polypeptide. 

The invention also relates to the use of a conjugate of the 
25 invention and the method of the invention for reducing the 
immunogenicity of pharmaceuticals and reducing the allergenicity 
of industrial products. 

Finally the invention relates to compositions comprising a 
conjugate of the invention and further ingredients used in 
30 industrial products or pharmaceuticals. 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 shows the anti-lipase serum antibody levels after 5 
weekly immunizations with i) control ii) unmodified lipase 
35 variant, iii) lipase variant-SPEG. (X: log (serum dilution) ; Y 
Optical Density (490/620)). 



DETAILED DESCRIPTION OF THE INVENTION 
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It is the object of the present invention to provide improved 
polypeptide-polymer conjugates suitable for industrial and 
pharmaceutical applications. 

Even though polypeptides used for pharmaceutical applications 
5 and industrial application can be quite different the principle of 
the present invention may be tailored to the specific type of 
parent polypeptide (i.e. enzyme, hormone peptides etc.). 

The inventors of the present invention have provided improved 
polypeptide-polymer conjugates with a reduced immune response in 
10 comparison to conjugates prepared from the corresponding parent 
polypeptides . 

The present inventors have found that polypeptides, such as 
enzymes, may be made less immunogenic and/or less allergenic by 
adding one or more attachment groups on the surface of the parent 
15 polypeptide. In addition thereto the inventors have found that a 
higher percentage of maintained residual functional activity may 
be obtained by removing attachment groups at or close to the 
functional site(s). 

In the first aspect the invention relates to an improved 
20 polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 
comparison to the number of attachment groups available on the 

25 corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 
of the polypeptide in comparison to the number of attachment 

30 groups available on the corresponding parent polypeptide. 

Whether the attachment groups should be added and/ or removed 
depends on the specific parent polypeptide. 

a) Addition of Attachment groups 
35 There may be a need for further attachment groups _on the 
polypeptide if only few attachment groups are available on the 
surface of the parent polypeptide. The addition of one or more 
attachment groups by substituting or inserting one or more amino 
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acid residues on the surface of the parent polypeptide increases 
the number of polymeric molecules which may be attached m 
comparison to the corresponding parent polypeptide. Con 3 ugates 
with an increased number of polymeric molecules attached thereto 
5 are generally seen to have a reduced immune response in comparison 
to the corresponding conjugates having fewer polymeric molecules 
coupled thereto. 

Any available amino acid residues on the surface of the 
polypeptide, preferentially not being at or close to the 
10 functional site(s) , such as the active site(s) of enzymes, may in 
principle be subject to substitution and/or insertion to provide 
additional attachment groups. 

As will be described further below the location of the 
additional coupled polymeric molecules may be of importance for 
15 the reduction of the immune response and the percentage of 
xnaintained residual functional activity of the polypeptide itself. 

A conjugate of the invention may typically have from 1 to 25, 
preferentially 1 to 10 or more additional polymeric molecules 
coupled to the surface of the polypeptide in comparison to the 
20 number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 

However, the optimal number of attachment group to be added 
depends (at least partly) on the surface area (i.e. molecular 
weight) of the parent polypeptide to be shielded by the coupled 
25 polymeric molecules, and further off-course also the number of 
already available attachment groups on the parent polypeptide. 

h ) Removing Atta^ m^nt: groups 

in the case of enzymes or other polypeptides performing their 
30 function by interaction with a substrate or the like, polymeric 
molecules coupled to the polypeptide might be impeded by the 
interaction between the polypeptide and its substrate or the like, 
if they are coupled at or close to the functional site(s) (i.e. 
active site of enzymes). This will most probably cause reduced 

35 activity. . " 

in the case of enzymes having one or more polymeric molecules 
coupled at or close to the active site a substantial loss of 
residual enzymatic activity can be expected. Therefore, according 
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to the invention conjugates may be constructed to maintain a 

higher percentage of residual enzymatic activity in comparison to 

a corresponding conjugates prepared on the basis of the parent 

enzyme in question. This may be done by substituting and/or 
5 deleting attachment groups at or close to the active site, hereby 

increasing the substrate affinity by improving the accessibility 

of the substrate in the catalytic cleft. 

An enzyme-polymer conjugate of the invention may typically have 

from 1 to 25, preferably 1 to 10 fewer polymeric molecules coupled 
10 at or close to the active site in comparison to the number of 

polymeric molecules of a conjugate prepared on the basis of the 

corresponding parent polypeptide. 

As will be explained below "at or close to" the functional 

site(s) means that no polymeric molecule (s) should be coupled 
15 within 5 A, preferably 8 A, especially 10 A of the functional 

site(s) . 

Removal of attachment groups at or close to the functional 
site(s) of the polypeptide may advantageously be combined with 
addition of attachment groups in other parts of the surface of the 
20 polypeptide. 

The total number of attachment groups may this way be 
unchanged, increased or decreased. However the location (s) of the 
total number of attachment group (s) is (are) improved assessed by 
the reduction of the immune response and/ or percentage of 
25 maintained residual activity. Improved stability may also be 
obtained this way. 

The number of attachment group s 

Generally seen the number of attachment groups should be 
30 balanced to the molecular weight and/or surface area of the 
polypeptide. The more heavy the polypeptide is the more polymeric 
molecules should be coupled to the polypeptide to obtain 
sufficient shielding of the epitope (s) responsible for antibody 
formation. 

35 Therefore, if the parent polypeptide molecule is . relatively 
light (e.g. l to 35 kDa) it may be advantageous to increase the 
total number of coupled polymeric molecules (outside the 
functional site(s)) to a total between 4 and 20. 
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If the parent polypeptide molecules is heavier, for instance 35 
to 60 kDa, the number of coupled polymeric molecules (outside the 
functional site(s)) may advantageously be increased to 7 to 40, 
and so on. 

The ratio between the molecular weight (Mw) of the polypeptide 
in question and the number of coupled polymeric molecules 
considered to be suitable by the inventors is listed below in 
Table 1. 



10 Table 1 



Molecular weight of parent 
polypeptide (M w ) kDa 


Number of polymeric 
molecules coupled to the 
polypeptide 


1 to 35 


4-20 


35 to 60 


7-40 


60 to 80 


10-50 


80 to 100 


15-70 


more than 100 


more than 20 



a » T^nonse v ., maintained rP«1rtna1 enzymatic activity 

Especially for enzymes, in comparison to many other types of 
polypeptides, there is a conflict between reducing the immune 
15 response and maintaining a substantial residual enzymatic activity 
as the activity of enzymes are connected with interaction between 
a substrate and the active site often present as a cleft in the 

enzyme structure. 

Without being limited to any theory it is believed that the 
loss of enzymatic activity of enzyme-polymer conjugates might be a 
consequence of impeded access of the substrate to the active site 
in the form of spatial hindrance of the substrate by especially 
bulky and/or heavy polymeric molecules to the catalytic cleft. It 
might also, at least partly, be caused by disadvantageous minor 
25 structural changes of the 3D structure of the enzyme due to the 
stress made by the coupling of the polymeric molecules. 



20 



Maintained residual activity 

A polypeptide-polymer conjugates of the invention has a 
3 0 substantially maintained functional activity. 
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A "substantially" maintained functional activity is in the 
context of the present invention defined as an activity which is 
at least between 20% and 30%, preferably between 30% and 40%, more 
preferably between 40% and 60%, better from 60% up to 80%, even 
5 better from 80% up to about 100%, in comparison to the activity of 
the conjugates prepared on the basis of corresponding parent 
polypeptides . 

in the case of polypeptide-polymer conjugates of the 
invention where no polymeric molecules are coupled at or close to 
10 the functional site(s) the residual activity may even be up to 
100% or very close thereto. If attachment group(s) of the parent 
polypeptide is (are) removed from the functional site the activity 
might even be more than 100% in comparison to modified (i.e. 
polymer coupled) parent polypeptide conjugate. 
15 Pnciit.ion of nnuoled polyme ric molecules 

To obtain an optimally reduced immune response (a.e. 
immunogenic and allergenic response) the polymeric molecules 
coupled to the surface of the polypeptide in question should be 
located in a suitable distance from each other. 
20 in a preferred embodiment of the invention the parent 
polypeptide is modified in a manner whereby the polymeric 
m olecules are spread broadly over the surface of the polypeptide, 
in the case of the polypeptide in question has enzymatic activity 
it is preferred to have as few as possible, especially none, 
25 polymeric molecules coupled at or close to the area of the active 

Slt ln the present context "spread broadly over the surface of the 
polypeptide" means that the available attachment groups are 
located so that the polymeric molecules shield different parts of 
30 the surface, preferable the whole or close to the whole surface 
area away from the functional site(s), to make sure that 
epitope (s) are shielded and hereby not recognized by the immune 

system or its antibodies. 

The area of antibody-polypeptide interaction typically 
35 covers an area of 500 A 2 , as described by Sheriff et al. 

(1987), Proc. Natl. Acad. Sci. USA 84, p. 8075-8079. 500 A 
corresponds to a rectangular box of 25 k x 20 k or a circular 
region of radius 12.6 k. Therefore, to prevent binding of 
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antibodies to the epitope (s) to the polypeptide in question it 
is preferred to have a maximum distance between two attachment 

groups around 10 A. 

consequently, amino acid residues which are located xn excess 
of 10 A away from already available attachment groups are 
suitable target residues. If two or more attachment groups on the 
polypeptide are located very close to each other it will in most 
cases result in that only one polymeric molecule will be coupled. 

To ensure a minimal loss of functional activity it is preferred 
not to couple polymeric molecules at or close to the functxonal 
site(s). said distance depends at least partly on the business of 
the polymeric molecules to be coupled, as impeded access by the 
bulky polymeric molecules to the functional site is undesired. 
Therefore, the more bulky the polymeric molecules are the longer 
should the distance from the functional site to the coupled 
polymeric molecules be. 

To maintain a substantial functional activity of the 
polypeptide in question attachment groups located wxthxn 5 A, 
preferred 8 A, especially 10 A from such functional sxte(s) 
) should be left uncoupled and may therefore advantageously be 
removed or changed by mutation. Functional residues should 
normally not be mutated/ removed, even though they potentxally 
can be the target for coupling polymeric molecules. In said 
case it may thus be advantageous to chose a coupling chemistry 
5 involving different attachment groups. 

Further, to provide a polypeptide having coupled polymeric 
molecules at (a) known epito P e(s). recognizable by the immune 
system or close to said epitope(s) specific mutations at such 
sites are also considered advantageous according to the invention. 
0 If the position of the epitope(s) is(are) unknown xt xs 
advantageous to couple several or many polymeric molecules to the 
polypeptide . 

As also mentioned above it is preferred that said attachment 
groups are spread broadly over the surface. 



J5 



The atta chment group 

Virtually all ionized groups, such as the amino groups of 
Lysine residues, are located on the surface of the polypeptxde 
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molecule (see for instance Thomas E. Creighton, (1993) , 
"Proteins", W.H. Freeman and Company, New York). 

Therefore, the number of readily accessible attachment groups 
(e.g. amino groups) on a modified or parent polypeptide equals 
5 generally seen the number of Lysine residues in the primary 
structure of the polypeptide plus the N-terminus amino group. 

The chemistry of coupling polymeric molecules to amino groups 
are quite simple and well established in the art. Therefore, it is 
preferred to add and/ or remove Lysine residues (i.e. attachment 
10 groups) to/ from the parent polypeptide in question to obtain 
improved conjugates with reduced immunogenicity and/or 
allergenicity and/or improved stability and/or high percentage 
maintained functional activity. 

Polymeric molecules may also be coupled to the carboxylic 
15 groups (-C00H) of amino acid residues on the surface of the 
polypeptide. Therefore, if using carboxylic groups (including the 
C-terminal group) as attachment groups addition and/or removal of 
Aspartate and Glutamate residues may also be a suitable according 
to the invention. 

2 0 If using other attachment groups, such as -SH groups, they 
may be added and /or removed analogously. 

Substitution of the amino acid residues is preferred over 
insertion, as the impact on the 3D structure of the polypeptide 
normally will be less pronounced. 

25 Preferred substitutions are conservative substitutions. In the 
case of increasing the number of attachment groups the 
substitution may advantageously be performed at a location having 
a distance of 5 A, preferred 8 A, especially 10 A from the 
functional site(s) (active site for enzymes) . 

30 An example of a suitable conservative substitution to obtain 
an additional amino attachment group is a Arginine to Lysine 
substitution. Examples of conservative substitutions to obtain 
additional carboxylic attachment groups are Aspargine to 
Aspartate/ Glutamate or Glutamine to Aspartate/ Glutamate 

35 substitutions. To remove attachment groups a Lysine residue may be 
substituted with a Arginine and so on. 

The parent polypeptide 
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in the context of the present invention the term "polypept ides- 
includes proteins, peptides and/or enzymes for pharmaceutical or 
industrial applications. Typically the polypeptides in question 
have a molecular weight in the range between about 1 to 100 kDa, 
5 often 15 kDa and 100 kDa. 

Pharmaceutical polypeptides 

The term "pharmaceutical polypeptides" is defined as polypep- 
tides, including peptides, such as peptide hormones, proteins 
10 and/ or enzymes, being physiologically active when introduced into 
the circulatory system of the body of humans and/or animals. 

Pharmaceutical polypeptides are potentially immunogenic as they 
are introduced into the circulatory system. 

Examples of "pharmaceutical polypeptides" contemplated 
15 according to the invention include insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, 
pigmentary hormones, somatomedin, erythropoietin, luteinizing 
hormone, chorionic gonadotropin, hypothalmic releasing factors, 
antidiuretic hormones, thyroid stimulating hormone, relaxin, 
20 interferon, thrombopoietin (TPO) and prolactin. 

Industrial polypeptides 

Polypeptides used for industrial applications often have an 

enzymatic activity. Industrial polypeptides (e.g. enzymes) are (in 
25 contrast to pharmaceutical polypeptides) not intended to be 

introduced into the circulatory system of the body. 

It is not very like that industrial polypeptides, such as 

enzymes used as ingredients in industrial compositions and/or 

products, such as detergents and personal care products, including 
30 cosmetics, come into direct contact with the circulatory system of 

the body of humans or animals, as such enzymes (or products 

comprising such enzymes) are not injected (or the like) into the 

bloodstream. 

Therefore, in the case of the industrial polypeptide the 
35 potential risk is respiratory allergy (i.e. IgE response) as a 
consequence of inhalation to polypeptides through the respiratory 
passage. 

In the context of the present invention "industrial polypep- 
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tides" are defined as polypeptides, including peptides, proteins 
and/or enzymes, which are not intended to be introduced into the 
circulatory system of the body of humans and/or animals. 

Examples of such polypeptides are polypeptides, especially 
5 enzymes, used in products such as detergents, household article 
products, agrochemicals, personal care products, such as skin care 
products, including cosmetics and toiletries, oral and dermal 
pharmaceuticals, composition use for processing textiles, 
compositions for hard surface cleaning, and compositions used for 
10 manufacturing food and feed etc. 

Enzymatic activity 

Pharmaceutical or industrial polypeptides exhibiting enzymatic 
activity will often belong to one of the following groups of 

15 enzymes including Oxidoreductases (E.C. 1, "Enzyme Nomenclature, 
(1992), Academic Press, Inc.), such as laccase and Superoxide 
dismutase (SOD); Transferases, (E.C. 2), such as transglutaminases 
(TGases) ; Hydrolases (E.C. 3), including proteases, especially 
subtilisins, and lipolytic enzymes; Isomerases (E.C. 5) , such as 

20 Protein disulfide Isomerases (PDI) . 

Hydrolases 

Proteolytic enzymes 

Contemplated proteolytic enzymes include proteases selected 
25 from the group of Aspartic proteases, such pepsins, Cysteine 

proteases, such as Papain, Serine proteases, such as subtilisins, 

or metallo proteases, such as Neutrase®. 

Specific examples of parent proteases include PD498 (WO 

93/24623 and SEQ ID NO. 2), Savinase® (von der Osten et al. , 
30 (1993), Journal of Biotechnology, 28, p. 55+, SEQ ID NO 3), 

Proteinase K (Gunkel et al. , (1989), Eur. J. Biochem, 179, p. 185- 

194), Proteinase R (Samal et al, (1990), Mol. Microbiol, 4, p. 

1789-1792), Proteinase T (Samal et al., (1989), Gene, 85, p. 329- 

333), Subtilisin DY (Betzel et al. (1993), Arch. Biophys, 302, no. 
35 2, p. 499-502), Lion Y (JP 04197182-A) , Rennilase® (Available from 

Novo Nordisk A/S) , JA16 (WO 92/17576) , Alcalase® (a natural 

subtilisin Carlberg variant) (von der Osten et al., (1993), 

Journal of Biotechnology, 28, p. 55+). 
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Lipolytic enzymes 

Contemplated lipolytic enzymes include Humicola lanuginosa 

lipases, e.g. the one described in EP 258 068 and EP 305 216 (See 
5 SEQ ID NO 6 below) , Humicola insolens, a Rhizomucor miehei lipase, 

e.g. as described in EP 238 023, Absidia sp. lipolytic enzymes (WO 

96/13578), a Candida lipase, such as a C. antarctica lipase, e.g. 

the C. antarctica lipase A or B described in EP 214 761, a 

Pseudomonas lipase such as a P. alcaligenes and P. 
10 pseudoalcaligenes lipase, e.g. as described in EP 218 272, a P. 

cepacia lipase, e.g. as described in EP 331 376, a Pseudomonas sp. 

lipase as disclosed in WO 95/14783, a Bacillus lipase, e.g. a B. 

subtilis lipase (Dartois et al., (1993) Biochemica et Biophysica 

acta 1131, 253-260), a B. stearothermophilus lipase (JP 64/744992) 
15 and a B. pumilus lipase (WO 91/16422). Other types of lipolytic 

include cutinases, e.g. derived from Pseudomonas mendocina as 

described in WO 88/09367, or a cutinase derived from Fusarium 

solani pisi {e.g. described in WO 90/09446) . 

20 Oxidoreductases 

Laccases 

Contemplated laccases include Polyporus pinisitus laccase (WO 
96/00290) , Myceliophthora laccase (WO 95/33836) , Schytalidium 
laccase (WO 95/338337) , and Pyricularia oryzae laccase (Available 
25 from Sigma J . 

Peroxidase 

Contemplated peroxidases include B. pumilus peroxidases (WO 
91/05858) , Myxococcaceae peroxidase (WO 95/11964) , Coprinus 
3 0 cinereus (WO 95/10602) and Arthromyces ramosus peroxidase 
(Kunishima et al. (1994), J. Mol. Biol. 235, p. 331-344). 

Transferases 

Transglutaminases 

3 5 Suitable transferases include any transglutaminases disclosed 
in WO 96/06931 (Novo Nordisk A/S) and WO 96/22366 (Novo Nordisk 
A/S) . 
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Zsom rases 

Protein Disulfide Isomerase 

Without being limited thereto suitable protein disulfide 
isomerases include PDIs described in WO 95/01425 (Novo Nordisk 
5 A/S) . 

The polymeric molecule 

The polymeric molecules coupled to the polypeptide may be any 
suitable polymeric molecule, including natural and synthetic homo- 

10 polymers, such as polyols (i.e. poly-OH) , polyamines (i.e. poly- 
NH2) and polycarboxyl acids (i.e. poly-COOH) , and further hetero- 
polymers i.e. polymers comprising one or more different coupling 
groups e.g. a hydroxy 1 group and amine groups. 

Examples of suitable polymeric molecules include polymeric 

15 molecules selected from the group comprising polyalkylene oxides 
(PAO) , such as polyalkylene glycols (PAG) , including polyethylene 
glycols (PEG) , methoxypolyethylene glycols (ntPEG) and polypropylen 
glycols, PEG-glycidyl ethers (Epox-PEG) , PEG-oxycarbonylimidazole 
(CDI-PEG) , Branced PEGs, poly-vinyl alcohol (PVA) , poly- 

20 carboxylates , poly-(vinylpyrolidone) , poly-D,L-amino acids, 
polyethylene-co-maleic acid anhydride, polystyrene-co-malic acid 
anhydrid, dextrans including carboxymethy 1-dextrans , heparin, 
homologous albumin, celluloses, including methylcellulose, 
carboxymethylcellulose , ethylcellulose, hydroxyethylcellulose 

25 carboxyethylcellulose and hydroxypropylcellulose, hydrolysates of 
chitosan, starches such as hydroxyethyl-straches and hydroxy 
propyl-starches , glycogen, agaroses and derivates thereof, guar 
gum, pullulan, inulin, xanthan gum, carrageenin, pectin, alginic 
acid hydrolysates and bio-polymers. 

30 Preferred polymeric molecules are non-toxic polymeric molecules 
such as (m) polyethylene glycol ( (m) PEG) which further requires a 
relatively simple chemistry for its covalently coupling to 
attachment groups on the enzyme ' s surface . 

Generally seen polyalkylene oxides (PAO) , such as polyethylene 

35 oxides, such as PEG and especially mPEG, are the preferred 
polymeric molecules, as these polymeric molecules, in comparison 
to polysaccharides such as dextran, pullulan and the like, have 
few reactive groups capable of cross- linking. 
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Even though all of the above mentioned polymeric molecules may 
be used according to the invention the methoxypolyethylene glycols 
(mPEG) may advantageously be used. This arise from the fact that 
methoxyethylene glycols have only one reactive end capable of 
5 conjugating with the enzyme. Consequently, the risk of cross- 
linking is less pronounced. Further, it makes the product more 
homogeneous and the reaction of the polymeric molecules with the 
enzyme easier to control. 

10 Preparation of enzyme variants 

Enzyme variants to be conjugated may be constructed by any 
suitable method. A number of methods are well established in 
the art. For instance enzyme variants according to the 
invention may be generated using the same materials and methods 

15 described in e.g. WO 89/06279 (Novo Nordisk A/S) , EP 130,756 
(Genentech) , EP 479,870 (Novo Nordisk A/S), EP 214,435 
(Henkel) , WO 87/04461 (Amgen) , WO 87/05050 (Genex) , EP appli- 
cation no. 87303761 (Genentech), EP 260,105 (Genencor) , WO 
88/06624 (Gist-Brocades NV) , WO 88/07578 (Genentech), WO 

20 88/08028 (Genex), WO 88/08033 (Amgen), WO 88/08164 (Genex), 
Thomas et al. (1985) Nature, 318 375-376; Thomas et al. (1987) 
J. Mol. Biol., 193, 803-813; Russel and Fersht (1987) Nature 
328 496-500. 

25 Generation of site directed mutations 

Prior to mutagenesis the gene encoding the polypeptide of 
interest must be cloned in a suitable vector. Methods for 
generating mutations in specific sites is described below. 

Once the polypeptide encoding gene has been cloned, and 

30 desirable sites for mutation identified and the residue to 
substitute for the original ones have been decided, these 
mutations can be introduced using synthetic oligonucleotides. 
These oligonucleotides contain nucleotide sequences flanking the 
desired mutation sites; mutant nucleotides are inserted during 

35 oligo-nucleotide synthesis. In a preferred method, Site-directed 
mutagenesis is carried out by SOE-PCR mutagenesis technique 
described by Kammann et al. (1989) Nucleic Acids Research 17(13), 
5404, and by Sarkar G. and Sommer, S.S. (1990) ; Biotechniques 8, 
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404-407. 

Activation of polymers 

If the polymeric molecules to be conjugated with the 
5 polypeptide in question are not active it must be activated by the 
use of a suitable technique. It is also contemplated according to 
the invention to couple the polymeric molecules to the polypeptide 
through a linker. Suitable linkers are well-known to the skilled 
person . 

10 Methods and chemistry for activation of polymeric molecules 

as well as for conjugation of polypeptides are intensively 
described in the literature. Commonly used methods for activation 
of insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glutaraldehyde, biepoxides, 

15 epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 
immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking" , CRC Press, Boca Raton; G.T. Hermanson et al., 

20 (1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 
polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulf onylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 

25 amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 
considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

30 In the following a number of suitable polymer activation 
methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of poly- 
peptides may be performed with the aid of diimide and for example 

35 amino-PEG or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. 
Soc, 98, 289-291) or diazoacetate/amide (Wong et al., (1992), 
"Chemistry of Protein Conjugation and Crosslinking", CRC Press). 
Coupling polymeric molecules to hydroxy groups are generally 
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very difficult as it must be performed in water. Usually 
hydrolysis predominates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
5 disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 
selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 
10 Techniques involving coupling electrophilically activated 

PEGs to the amino groups of Lysines may also be useful. Many of 
the usual leaving groups for alcohols give rise to an amine 
linkage. For instance, alkyl sulfonates, such as tresylates 
(Nilsson et al., (1984), Methods in Enzyraology vol. 104, Jacoby, 
15 W. B., Ed., Academic Press: Orlando, p. 56-66; Nilsson et al. , 
(1987), Methods in Enzymology vol. 135; Mosbach, K. , Ed.; Academic 
Press: Orlando, pp. 65-79; Scouten et al. , (1987), Methods in 
Enzymology vol. 135, Mosbach, K. , Ed., Academic Press: Orlando, 
1987; pp 79-84; Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 
20 93, pp. 4217-4219), mesylates (Harris, (1985), supra; Harris et 
al., (1984), J. Polym. Sci. Polym. Chem. Ed. 22, pp 341-352), aryl 
sulfonates like tosylates, and para-nitrobenzene sulfonates can be 
used. 

organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
25 converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with nucleo- 
philes like amino groups in polypeptides allow stable linkages to 
be formed between polymer and polypeptide. In addition to high 
conjugation yields, the reaction conditions are in general mild 
30 (neutral or slightly alkaline pH, to avoid denaturation and little 
or no disruption of activity) , and satisfy the non-destructive re- 
quirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more un- 
stable decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 
35 (1995), Bioconjugate Chem., 6, 150-165). Epoxides may also been 
used for creating amine bonds but are much less reactive than the 
above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise 
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to carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem. , 15, p. 100-114; Monfardini et al. , (1995), Biocon- 
5 jugate Chem., 6, 62-69, with imidazole (Allen et al. , (1991), 
Carbohydr. Res., 213, pp 309-319), with para-nitrophenol , DMAP (EP 
632 082 Al, (1993), Looze, Y. ) etc. The derivatives are usually 
made by reacting the chloroformate with the desired leaving group. 
All these groups give rise to carbamate linkages to the peptide. 

10 Furthermore, isocyanates and isothiocyanates may be employed 

yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these com- 

15 pounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 
(1992), Zalipsky). This group may be activated with N-hydroxy suc- 

2 0 cinimide . 

Furthermore, a special linker can be introduced. The oldest 
being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
25 375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
30 Greenwald, R. B.) thus introducing an additional amide linkage. 

As some peptides do not comprise many Lysines it may be 
advantageous to attach more than one PEG to the same Lysine. This 
can be done e.g. by the use of l,3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
35 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

The coupling technique used in the examples is the N- 
succinimidyl carbonate conjugation technique descried in WO 
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90/13590 (Enzon) . 

Method for preparing improved conjugates 

It is also an object of the invention to provide a method for 
5 preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
10 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and /or 

ii) substituting or deleting one or more amino acid residues 
15 selected in step b) at or close to the functional site(s), 

d) coupling polymeric molecules to the mutated polypeptide. 

Step a) Identifying amino acid residues located on the surface of 
the parent polypeptide 

20 

3-dimensional structure ( 3D- structure) 

To perform the method of the invention a 3-dimensional 

structure of the parent polypeptide in question is required. 

This structure may for example be an X-ray structure, an NMR 
25 structure or a model-built structure. The Brookhaven Databank 

is a source of X-ray- and NMR-structures . 

A model-built structure may be produced by the person 

skilled in the art if one or more 3D-structure(s) exist (s) of 

homologous polypeptide (s) sharing at least 30% sequence 
30 identity with the polypeptide in question. Several software 

packages exist which may be employed to construct a model 

structure. One example is the Homology 95.0 package from 

Biosym. 

Typical actions required for the construction of a model 
35 structure are: alignment of homologous sequences for which 3D- 
structures exist, definition of Structurally Conserved Regions 
(SCRs) , assignment of coordinates to SCRs, search for 
structural fragments/ loops in structure databases to replace 
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Variable Regions, assignment of coordinates to these regions, 
and structural refinement by energy minimization. Regions 
containing large inserts (£3 residues) relative to the known 
3D-structures are known to be quite difficult to model, and 
5 structural predictions must be considered with care. 

Having obtained the 3D-structure of the polypeptide in 
question, or a model of the structure based on homology to 
known structures, this structure serves as an essential 
prerequisite for the fulfillment of the method described below. 

10 

Step b) Selection of target amino acid residue s for mutation 
Target amino acid residues to be mutated are according to 
the invention selected in order to obtain additional or fewer 
attachment groups, such as free amino groups (-NH 2 ) or free 
15 carboxylic acid groups (-COOH) , on the surface of the 

polypeptide and/or to obtain a more complete and broadly spread 
shielding of the epitope (s) on the surface of the polypeptide. 

Conservative substitution 
20 It is preferred to make conservative substitutions in the 
polypeptide, as conservative substitutions secure that the 
impact of the mutation on the polypeptide structure is limited. 

In the case of providing additional amino groups this may be 
done by substitution of Arginine to Lysine, both residues being 

2 5 positively charged, but only the Lysine having a free amino 

group suitable as an attachment groups. 

In the case of providing additional carboxylic acid groups 
the conservative substitution may for instance be an Aspargine 
to Aspartic acid or Glutamine to Glutamic acid substitution. 

3 0 These residues resemble each other in size and shape, except 

from the carboxylic groups being present on the acidic 
residues. 

In the case of providing fewer attachment groups, e.g. at or 
close to the active site, a Lysine may be substituted with a 
35 Arginine, and so on. 

Which amino acids to substitute depends in principle on the 
coupling chemistry to be applied. 
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Non-conservative substitution 

The mutation may also be on target amino acid residues which 
are less /non-conservative. Such mutation is suitable for 
obtaining a more complete and broadly spread shielding of the 
5 polypeptide surface than can be obtained by the conservative 
substitutions . 

The method of the invention is first described in general 
terms, and subsequently using specific examples. 

Note the use of the following terms: 
10 Attachment_residue : residue (s) which can bind polymeric 

molecules, e.g. Lysines (amino group) or Aspartic/Glutamic 
acids (carboxylic groups) . N- or C-terminal amino/ carboxy lie 
groups are to be included where relevant. 

Mutation_residue: residue (s) which is to be mutated, e.g. 
15 Arginine or Aspargine/Glutamine. 

Essential_catalytic_residues: residues which are known to be 
essential for catalytic function, e.g. the catalytic triad in 
Serine proteases. 

Solvent_exposed_residues: These are defined as residues which 
20 are at least 5% exposed according to the BIOSYM/ INSIGHT 

algorithm found in the module Homology 95.0. The sequence of 
commands are as follows: 

Homology=>ProStat=>Access_Surf =>Solv_Radius 1.4; Heavy atoms 
only; Radii source VdW; Output: Fractional Area; Polarity 
25 source: Default. The file f ilename_area. tab is produced. Note: 
For this program to function properly all water molecules must 
first be removed from the structure. 
It looks for example like: 
# PD498FINALMODEL 



30 


# residue 


area 




TRP_1 


136.275711 




SER_2 


88.188095 




PRO_3 


15.458788 




ASN_4 


95.322319 


35 


ASP_5 


4.903404 - 




PRO_6 


68.096909 




TYR_7 


93.333252 




TYR 8 


31.791576 
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SER_9 95.983139 
. . continued 

1. Identification of residues which are more than 10 A away 

5 from the closest attachment_residue, and which are located at 
least 8 A away from essential_catalytic_residues. This residue 
subset is called REST, and is the primary region for 
conservative mutation_residue to attachment_residue 
substitutions . 

10 

2. Identification of residues which are located in a 0-5 A 
shell around subset REST, but at least 8 A away from 
essential_catalytic_residues. This residue subset is called 
SUB5B. This is a secondary region for conservative 

15 mutation_residue to attachment_residue substitutions, as a 

ligand bound to an attachment_residue in SUB5B will extend into 
the REST region and potentially prevent epitope recognition. 

3. Identification of solvent_exposed mutation_residues in REST 
20 and SUB5B as potential mutation sites for introduction of 

attachment_residues . 

4. Use BIOSYM/ INSIGHT • s Biopolymer module and replace residues 
identified under action 3 . 

25 

5. Repeat 1-2 above producing the subset RESTx. This subset 
includes residues which are more than 10 A away from the 
nearest attachment_residue, and which are located at least ft A 
away from essential catalytic residues. 

30 

6. Identify solvent_exposed_residues in RESTx. These are 
potential sites for less/non-conservative mutations to 
introduce atttachment_residues. 

35 

Step Substituting, inserting or deleting amino acid residues 

The mutation (s) performed in step c) may be performed by 
standard techniques well known in the art, such as site-directed 



* . * 
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mutagenesis (see, e.g., Sambrook et al. (1989), Sambrook et al., 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. 

A general description of nucleotide substitution can be found 
in e.g. Ford et al . , 1991, Protein Expression and Purification 2, 
5 p. 95-107. 



Step d) Coupling polymeric molecules to the modified parent enzyme 
Polypepti de-polymer conjugates of the invention may be 
prepared by any coupling method known in the art including the 
10 above mentioned techniques. 

Coupling of polymeric molecules to the polypeptide in question 

If the polymeric molecules to be conjugated with the 
polypeptide are not active it must be activated by the use of a 

15 suitable method. The polymeric molecules may be coupled to the 
polypeptide through a linker. Suitable linkers are well known to 
the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described 

20 in the literature. Commonly used methods for activation of 
insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glutaraldehyde, biepoxides, 
epichlorohydr in , divinylsulf one, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 

25 immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking", CRC Press, Boca Raton; G.T. Hermanson et al., 
(1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 

30 polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulfonylhalides, 

divinylsulf one, carbodiimide etc. The functional groups being 
amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 

35 considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation 
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methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of enzymes 
can be performed with the aid of diimide and for example amino-PEG 
5 or hydrazino-PEG (Pollak et al. f (1976), J. Amr. Chem. Soc, 98, 
289-291) or diazoacetate/amide (Wong et al., (1992), "Chemistry of 
Protein Conjugation and Crosslinking" , CRC Press). 

Coupling polymeric molecules to hydroxy groups are generally 
very difficult as it must be performed in water. Usually 

10 hydrolysis predominates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the artho- pyridyl 
disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 

15 selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

Techniques involving coupling electrophilically activated PEGs 
to the amino groups of Lysines are also be useful. Many of the 

20 usual leaving groups for alcohols give rise to an amine linkage. 
For instance, alkyl sulfonates, such as tresylates (Nilsson et 
al., (1984), Methods in Enzymology vol. 104, Jacoby, W. B., Ed., 
Academic Press: Orlando, p. 56-66; Nilsson et al. , (1987), Methods 
in Enzymology vol. 135; Mosbach, K. , Ed.; Academic Press: Orlando, 

25 pp. 65-79; Scouten et al. , (1987), Methods in Enzymology vol. 135, 
Mosbach, K. , Ed., Academic Press: Orlando, 1987; pp 79-84; 
Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 93, pp. 4217-4- 
219), mesylates (Harris, (1985), supra ; Harris et al., (1984), J. 
Polym. Sci. Polym. Chem. Ed. 22, pp. 341-352), aryl sulfonates 

30 like tosylates, and para-nitrobenzene sulfonates can be used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with 
nucleophiles like amino groups in polypeptides allow stable 

35 linkages to be formed between polymer and polypeptide. In addition 
to high conjugation yields, the reaction conditions are in general 
mild (neutral or slightly alkaline pH, to avoid denaturation and 
little or no disruption of activity) , and satisfy the non- 
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destructive requirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more 
unstable decomposing into PEG, dioxane, and sulfonic acid 
(Zalipsky, (1995), Bioconjugate Chem., 6, 150-165). Epoxides may 
5 also been used for creating amine bonds but are much less reactive 
than the above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise to 
carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
10 patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem. , 15, p. 100-114; Monfardini et al., (1995), 
Bioconjugate Chem., 6, 62-69, with imidazole (Allen et al., 

(1991) , Carbohydr. Res., 213, pp 3 09-319), with para-nitrophenol, 
DMAP (EP 632 082 Al, (1993), Looze, Y.) etc. The derivatives are 

15 usually made by reacting the chloroformate with the desired 
leaving group. All these groups give rise to carbamate linkages to 
the peptide. 

Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 
20 Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these 
compounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
25 also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 

(1992) , Zalipsky) . This group may be activated with N-hydroxy 
succinimide. 

Furthermore, a special linker can be introduced. The oldest 
30 being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
35 yields a Very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
Greenwald, R. B.) thus introducing an additional amide linkage. 
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As some peptides do not comprise many Lysines it may be advan- 
tageous to attach more than one PEG to the same Lysine. This can 
be done e.g. by the use of l,3-diaraino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
5 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

Addition of attachment groups 

Specific examples of PD498 variant-SPEG conjugates 
10 A specific example of a protease is the parent PD498 (WO 
93/24623 and SEQ ID NO. 2) . The parent PD498 has a molecular 
weight of 29 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from 


the 


Arginine 


Lysine 


active site 








0-5 A 




1 




5-10 A 






10-15 A 


5 


6 


15-20 A 


2 


3 


20-25 A 




1 


3 


total 




9 


12 



15 The inventors examined which parent PD498 sites on the surface 
may be suitable for introducing additional attachment groups. 

A. Suitable conservative Arginine to Lysine substitutions in 
parent PD498 may be any of R51K, R62K, R121K, R169K, R250K, R28K, 
R190K. 

20 B. Suitable non-conservative substitutions in parent PD498 may 
be any of P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, 
N65K, G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, 
S220K, Y221K, G262K. 

As there is no Lysine residues at or close to the active site 

25 there is no need for removing any attachment group. 

PD498 variant-SPEG conjugates may be prepared using any of the 
above mentioned PD498 variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 

3 0 described below. 
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Removal of attachm nt groups 

Specific examples of BPN" variant-SPEG conjugates 

A specific example of a protease having an attachment group in 
5 the active site is BPN" which has 11 attachment groups (plus an N- 
terminal amino group) : BPN' has a molecular weight of 28 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from 


Arginine 


Lysine 


the active site 






0-5 A 




1 


5-10 A 






10-15 A 


1 


4 


15-20 A 


1 


4 


20-25 A 




2 


total 


2 


11 



10 The Lysine residue located within 0-5 A of the active site can 

according to the invention advantageously be removed. Specifically 

this may be done by a K94R substitution. 

BPN" variant-SPEG conjugates may be prepared using the above 

mentioned BPN" variant as the starting material by any conjugation 
15 technique known in the art for coupling polymeric molecules to 

amino groups on the enzyme. 

Addition and removal of attachment groups 

Specific example of Savinase®-SPEG -conjugates 
20 As described in Example 2 parent Savinase® (von der Osten et 

al. , (1993), Journal of Biotechnology, 28, p. 55+ and SEQ ID NO. 

3) may according to the invention have added a number of amino 

attachment groups to the surface and removed an amino attachment 

group close to the active site. 
25 Any of the following substitutions in the parent Savinase® 

are sites for mutagenesis: R10K, R19K, R45K, R145K, R170K, 

R186K and R247K. 

The substitution K94R are identified as a mutation suitable 

for preventing attachment of polymers close to active site. 
30 Savinase® variant-SPEG conjugates may be prepared using any of 
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the above mentioned Savinase® variants as the starting material by 
any conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. 

5 Addition of attachment groups 

A specific examples of Humicola lanuginosa lipase variants-SPEG 
conjugates 

Specific examples of lipase variants with reduced 
immunogenicity using the parent Huminocal lanuginosa DSM 4109 
10 lipase (see SEQ ID No 6) as the backbone for substitutions are 
listed below. 

The parent unmodified Humicola lanuginosa lipase has 8 
attachment groups including the N-terminal NH2 group and a 
molecular weight of about 29 kDa. 
15 A. Suitable conservative Arginine to Lysine substitutions in the 
parent lipase may be any of R133K, R139K, R160K, R179K, R209K, 
R118K and R125K. 

Suitable non-conservative substitutions in the parent lipase 
may be any of: 

20 A18K,G31K,T32K,N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 

Further suitable non- conservative substitution in the Humicola 
lanuginosa lipase include: E87K or D254K. 

2 5 Lipase variant-SPEG conjugates may be prepared using any of the 

above mentioned lipase variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 
described below. 

30 In Example 12 below is it shown that a conjugate of the 
Humicola lanuginosa lipase variant with a E87K+D254K substitutions 
coupled to S-PEG 15,000 has reduced immunogenic response in Balb/C 
mice in comparison to the corresponding parent unmodified enzyme. 

3 5 Immunogenic ity and Allergenicity 

"Immunogenicity" is a wider term than "antigenicity" and 
"allergenicity" , and expresses the immune system's response to the 
presence of foreign substances. Said foreign substances are called 
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immunogens, antigens and allergens depending of the type of immune 
response the elicit. 

An "immunogen" may be defined as a substance which, when intro- 
duced into circulatory system of animals and humans, is capable of 
5 stimulating an immunologic response resulting in formation of 
immunoglobulin . 

The term "antigen" refers to substances which by themselves are 
capable of generating antibodies when recognized as a non-self 
molecule . 

10 Further, an "allergen" may be defined as an antigen which may 
give rise to allergic sensitization or an allergic response by IgE 
antibodies (in humans, and molecules with comparable effects in 
animals) . 

15 Assessment of immunoqencity 

Assessment of the immunogenicity may be made by injecting 
animal subcutaneous ly to enter the immunogen into the circulation 
system and comparing the response with the response of the 
corresponding parent polypeptide. 

20 The "circulatory system" of the body of humans and animals 
means, in the context of the present invention, the system which 
mainly consists of the heart and blood vessels. The heart delivers 
the necessary energy for maintaining blood circulation in the 
vascular system. The circulation system functions as the 

25 organism's transportation system, when the blood transports O2, 
nutritious matter, hormones, and other substances of importance 
for the cell regulation into the tissue. Further the blood removes 
CO2 from the tissue to the lungs and residual substances to e.g. 
the kidneys. Furthermore, the blood is of importance for the 

30 temperature regulation and the defence mechanisms of the body, 
which include the immune system. 

A number of in vitro animal models exist for assessment of the 
immunogenic potential of polypeptides. Some of these models give a 
suitable basis for hazard assessment in man. Suitable models 

35 include a mice model. 

This model seek to identify the immunogenic response in the 
form of the IgG response in Balb/C mice being injected 
subcutaneous ly with modified and unmodified polypeptides. 
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Also other animal models can be used for assessment of the 
immunogenic potential. 

A polypeptide having "reduced immunogenicity" according to the 
invention indicates that the amount of produced antibodies, e.g. 
5 immunoglobulin in humans, and molecules with comparable effects in 
specific animals, which can lead to an immune response, is 
significantly decreased, when introduced into the circulatory 
system, in comparison to the corresponding parent polypeptide. 

For Balb/C mice the IgG response gives a good indication of the 
10 immunigenic potential of polypeptides. 

Assessment of allergenicity 

Assessment of allergenicity may be made by inhalation tests, 
comparing the effect of intratracheal ly (into the trachea) 
15 administrated parent enzymes with the corresponding modified 
enzymes according to the invention. 

A number of in vivo animal models exist for assessment of the 
allegenicity of enzymes. Some of these models give a suitable 
basis for hazard assessment in man. Suitable models include a 
20 guinea pig model and a mouse model. These models seek to identify 
respiratory allergens as a function of elicitation reactions 
induced in previously sensitised animals. According to these 
models the alleged allergens are introduced intratracheal ly into 
the animals. 

25 A suitable strain of guinea pigs, the Dunkin Hartley strain, do 
not as humans, produce IgE antibodies in connection with the 
allergic response. However, they produce another type of antibody 
the IgGIA and IgGIB (see e.g. Prento, ATLA, 19, p. 8-14, 1991), 
which are responsible for their allergenic response to inhaled 
30 polypeptides including enzymes. Therefore, when using the Dunkin 
Hartley animal model, the relative amount of IgGIA and IgGIB is a 
measure of the allergenicity level. 

The Balb/C mice strain is suitable for intratracheal exposure. 
Balb/C mice produce IgE as the allergic response. 
- 35 More details on assessing respiratory allergens in guinea pigs 
and mice is described by Kimber et al.,(1996), Fundamental and 
Applied Toxicology, 33, p. 1-10. 
Other animals such as rats, rabbits etc. may also be used for 
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comparable studies. 
Composition 

The invention relates to a composition comprising a 
5 polypeptide-polymer conjugate of the invention. 

The composition may be a pharmaceutical or industrial 
composition . 

The composition may further comprise other polypeptides, 
proteins or enzymes and/or ingredients normally used in e.g. 

10 detergents , including soap bars, household articles, 
agrochemicals , personal care products, including skin care 
compositions, cleaning compositions for e.g. contact lenses, oral 
and dermal pharmaceuticals, composition use for treating textiles, 
compositions used for manufacturing food, e.g. baking, and feed 

15 etc. 

Use of the polvpeptide-polvmer conjugate 

The invention also relates to the use of the method of the 
invention for reducing the immune response of polypeptides. 
20 It is also an object of the invention to use the polypeptide- 

polymer conjugate of the invention to reduce the allergenicity of 
industrial products, such as detergents, such as laundry, disk 
wash and hard surface cleaning detergents, and food or feed 
products . 

25 

MATERIAL AND METHODS 
Materials 

Enzymes ; 

PD498: Protease of subtilisin type shown in WO 93/24623. The 
3 0 sequence of PD498 is shown in SEQ ID NO. 1 and 2. 
Savinase® (Available from Novo Nordisk A/S) 

Humicola lanuginosa lipase: Available from Novo Nordisk as 
lipolase® and is further described in EP 305,216. The DNA and 
protein sequence is shown in SEQ ID NO 5 and 6, respectively. 
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Strains: 

B . subtilis 309 and 147 are variants of Bacillus lentus, 
deposited with the NCIB and accorded the accession numbers NCIB 
5 10309 and 10147, and described in US Patent No. 3,723,250 
incorporated by reference herein. 

E. coli MC 1000 (M.J. Casadaban and S.N. Cohen (1980); J . 
Mol. Biol. 138 179-207), was made r~,m + by conventional methods 
and is also described in US Patent Application Serial No. 
10 039,298. 

Vectors : 

pPD498: E. coli - B. subtilis shuttle vector (described in 
US patent No. 5,621,089 under section 6.2.1.6) containing the 
15 wild-type gene encoding for PD498 protease (SEQ ID NO. 2). The 
same vector is use for mutagenesis in E. coli as well as for 
expression in B . subtilis. 

General molecular biology methods: 

20 Unless otherwise mentioned the DNA manipulations and 

transformations were performed using standard methods of 
molecular biology (Sambrook et al. (1989) Molecular cloning: A 
laboratory manual, Cold Spring Harbor lab., Cold Spring Harbor, 
NY; Ausubel, F. M. et al. (eds.) "Current protocols in 

25 Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R. , 
and Cutting, S. M. (eds.) "Molecular Biological Methods for 
Bacillus". John Wiley and Sons, 19-90). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

30 

Materials, chemicals and solutions: 

Horse Radish Peroxidase labeled anti-rat-Ig (Dako, DK, P162, # 
031; dilution 1:1000). 
35 Mouse anti-rat IgE (Serotec MCA193; dilution 1:200). 
Rat anti-mouse IgE (Serotec MCA419; dilution 1:100). 
Biotin-labeled mouse anti-rat IgGl monoclonal antibody (Zymed 03- 
9140; dilution 1:1000) 
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Biotin-labeled rat anti-mouse IgGl monoclonal antibody (Serotec 
MCA336B; dilution 1:1000) 

Streptavidin-horse radish peroxidase (Kirkegard & Perry 14-30-00; 

dilution 1:1000) . 
5 covaLink NH 2 plates (Nunc, Cat# 459439) 
• cyanuric chloride (Aldrich) 
Acetone (Merck) 

Rat anti-Mouse IgGl, biotin (SeroTec, Cat# MCA336B) 
Streptavidin, peroxidase (KPL) 
10 Ortho-Phenylene-diamine (OPD) (Kem-en-Tec) 
H 2 0 2 , 30% (Merck) 
Tween 20 (Merck) 
Skim Milk powder (Difco) 
H 2 S0 4 (Merck) 

15 

Buffers and Solutions: 

Carbonate buffer (0.1 M, pH 10 (1 liter)) Na 2 C0 3 10.60 g 

PBS ( P H 7.2 (1 liter)) NaCl 8.00 g 

kci °- 20 g 

20 K 2 HP0 4 1.04 g 

KH 2 P0 4 0.32 g 

Washing buffer PBS, 0.05% (v/v) Tween 20 
Blocking buffer PBS, 2% (wt/v) Skim Milk powder 

Dilution buffer PBS, 0.05% (v/v) Tween 20, 0.5% (wt/v) Skim Milk 
25 powder 

Citrate buffer (0.1M, P H 5.0-5.2 (1 liter) )NaCitrate 20.60 g 

Citric acid 6.30 g 

ftntivation of Co^ T.inTc plates: 

• Make a fresh stock solution of 10 mg cyanuric chloride per ml 
30 acetone. 

• Just before use, dilute the cyanuric chloride stock solution 
into PBS, while stirring, to a final concentration of Img/ml. 

• Add 100 ml of the dilution to each well of the CovaLink NH2 
plates, and incubate for 5 minutes at room temperature. 

3 5 • Wash 3 times with PBS. 

. Dry the freshly prepared activated plates at 50°C for 30 

minutes . 

• immediately seal each plate with sealing tape. 
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• Preactivated plates can be stored at room temperature for 3 
weeks when kept in a plastic bag. 



Sodium Borate, borax (Sigma) 
5 3,3-Dimethyl glutaric acid (Sigma) 
CaCl 2 (Sigma) 

Tresyl chloride (2 ,2 ,2-trif louroethansulfonyl chloride) (Fluka) 
l-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC) (Fluka) 
W-Hydroxy succinimide (Fluka art. 56480) ) 
10 Phosgene (Fluka art. 79380) 
Lactose (Merck 7656) 

PMSF (phenyl methyl sulfonyl flouride) from Sigma 
Succinyl-Alanine-Alanine-Proline-Phenylalanine-para-nitroanilide 

(Suc-AAPF-pNP) Sigma no. S-7388, Mw 624.6 g/mole. 

15 

Colour inq substrate: 

OPD: o-phenylene-diamine, (Kementec cat no. 4260) 



Test Animals: 

20 Dunkin Hartley guinea pigs (from Charles River, DE) 

Female Balb/C mice (about 20 grams) purchased from Bomholdtgaard, 
Ry , Denmark . 



Equipment : 
25 XCEL II (Novex) 

ELISA reader (UVmax, Molecular Devices) 
HPLC (Waters) 
PFLC (Pharmacia) 

Superdex-75 column, Mono-Q, Mono S from Pharmacia, SW. 
3 0 SLT: Fotometer from SLT Lablnstruments 

Size-exclusion chromatograph (Spherogel TSK-G2000 SW) . 
Size-exclusion chromatograph (Superdex 200, Pharmacia, SW) 
Amicon Cell 



35 Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases, ligases 
etc., are obtained from New England Biolabs. Inc. 
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M thods 

ELISA procedure for determination of IgG^ positive guinea pigs 

ELISA microtiter plates are coated with rabbit anti-PD498 
5 1:8000 in carbonate buffer and incubated over night at 4°C. The 
next day the plates is blocked with 2% BSA for 1 hour and washes 3 
times with PBS Tween 20. 

1 ug/ml PD498 is added to the plates and incubated for 1 hour, 
then washed 3 times with PBS Tween 20. 
10 All guinea pig sera samples and controls are applied to the 

ELISA plates with 2 jj.1 sera and 98 ul PBS, incubated for 1 hour 
and washed 3 times with PBS Tween 20. 

Then goat anti-guinea pig IgGi (1:4000 in PBS buffer (Nordic 
Immunology 44-682)) is applied to the plates, incubated for 1 hour 
15 and washed with PBS tween 20. 

Alkaline phosphatase marked rabbit anti-goat 1:8000 (Sigma 
A4187) is applied and incubated for 1 hour, washed 2 times in PBS 
Tween20 and 1 time with diethanol amine buffer. 

The marked alkaline phosphatase is developed using p- 
20 nitrophenyl phosphate for 3 0 minutes at 37°C or until appropriate 
colour has developed. 

The reaction is stopped using Stop medium (K2HP04/HaH 3 buffer 
comprising EDTA (pH 10)) and read at OD 405/650 using a ELISA 
reader. 

25 Double blinds are included on all ELISA plates. 

Positive and negative sera values are calculated as the 
average blind values added 2 times the standard deviation. This 
gives an accuracy of 95%. 

30 Determination of the molecule weight 

Electrophoretic separation of proteins was performed by standard 
methods using 4-20% gradient SDS poly acrylamide gels (Novex) . 
Proteins were detected by silver staining. The molecule weight was 
measured relative to the mobility of Mark-12® wide range molecule 

3 5 weight standards from Novex. 



Protease activity 
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Analysis with Suc-Ala-Ala-Pro-Phe-pNa: 

Proteases cleave the bond between the peptide and p- 
nitroaniline to give a visible yellow colour absorbing at 405 nm. 

Buffer: e.gr. Britton and Robinson buffer pH 8.3 
5 Substrate: 100 mg suc-AAPF-pNa is dissolved into 1 ml dimethyl 

sulfoxide (DMSO) . 100 ul of this is diluted into 10 ml with 
Britton and Robinson buffer. 

The substrate and protease solution is mixed and the 
absorbance is monitored at 405 nm as a function of time and ABS405 
10 nm/min. The temperature should be controlled (20-50°C depending on 
protease) . This is a measure of the protease activity in the 
sample . 

Proteolytic Activity 

15 In the context of this invention proteolytic activity is 

expressed in Kilo NOVO Protease Units (KNPU) . The activity is 
determined relatively to an enzyme standard (SAVINASE_) , and 
the determination is based on the digestion of a dimethyl 
casein (DMC) solution by the proteolytic enzyme at standard 

20 conditions, i.e. 50°C, pH 8.3, 9 min. reaction time, 3 min. 
measuring time. A folder AF 220/1 is available upon request to 
Novo Nordisk A/S, Denmark, which folder is hereby included by 
reference. 

A GU is a Glycine Unit, defined as the proteolytic enzyme 
25 activity which, under standard conditions, during a 15-minutes' 
incubation at 40°C, with N-acetyl casein as substrate, produces 
an amount of NH2~group equivalent to 1 mmole of glycine. 

Enzyme activity can also be measured using the PNA assay, 
according to reaction with the soluble substrate succinyl- 
30 alanine-alanine-proline-phenyl-alanine-para-nitrophenol, which 
is described in the Journal of American Oil Chemists Society, 
Rothgeb, T.M. , Goodlander, B.D., Garrison, P.H. , and Smith, 
L.A., (1988). 

35 Fermentation of PD498 variants 

Fermentation of PD498 variants in B . subtilis are performed 
at 30°C on a rotary shaking table (300 r.p.m.) in 500 ml baffled 
Erlenmeyer flasks containing 100 ml BPX medium for 5 days. In 
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order to make an e.g. 2 liter broth 20 Erlenmeyer flasks are 
fermented simultaneously. 

Media: 

5 BPX; Composition (per liter) 



Sodium caseinate lOg 

The starch in the medium is liquefied with a-amylase and 
the medium is sterilized by heating at 120°C for 45 minutes. 
After sterilization the pH of the medium is adjusted to 9 by 
15 addition of NaHC0 3 to 0.1 M. 

Purification of PD498 variants 

Approximately 1.6 litres of PD498 variant fermentation 
broth are centrifuged at 5000 rpm for 35 minutes in 1 litre 

2 0 beakers. The supernatants are adjusted to pH 7.0 using 10% 
acetic acid and filtered on Seitz Supra S100 filter plates. 
The filtrates are concentrated to approximately 400 ml using an 
Amicon CH2A UF unit equipped with an Amicon S1Y10 UF cartridge. 
The UF concentrate is centrifuged and filtered prior to 

25 absorption at room temperature on a Bacitracin affinity column 
at pH 7. The PD498 variant is eluted from the Bacitracin column 
at room temperature using 25% 2-propanol and 1 M sodium 
chloride in a buffer solution with 0.01 dime-thyl-glutaric 
acid, 0.1 M boric acid and 0.002 M calcium chloride adjusted to 

30 pH 7. 

The fractions with protease activity from the Bacitracin 
purification step are combined and applied to a 750 ml Sephadex 
G25 column (5 cm diameter) equilibrated with a buffer 
containing 0.01 dimethylglutaric acid, 0.1 M boric acid and 
35 0.002 M calcium chloride adjusted to pH 6.0. 

Fractions with proteolytic activity from the Sephadex G25 
column are combined and applied to a 150 ml CM Sepharose CL 6B 
cat-ion exchange column (5 cm diameter) equilibrated with a 



Potato starch 



lOOg 



10 



Ground barley 
Soybean flour 
Na 2 HP0 4 X 12 H 2 0 
Pluronic 



50g 
20g 
9g 

O.lg 
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buffer containing 0.01 M dimethylglutaric acid, 0.1 M boric 
acid, and 0.002 M calcium chloride adjusted to pH 6.0. 
The protease is eluted using a linear gradient of 0-0.5 M 
sodium chloride in 1 litres of the same buffer. 
5 Protease containing fractions from the CM Sepharose column are 
combined and filtered through a 2\jl filter. 



Balb/C mice IgG ELISA Procedure: 

The antigen is diluted to 1 mg/ml in carbonate buffer. 
100 ml is added to each well. 
The plates are coated overnight at 4°C. 

Unspecific adsorption is blocked by incubating each well for 1 
hour at room temperature with 200 ml blocking buffer. 
The plates are washed 3x with 300 ml washing buffer. 
Unknown mouse sera are diluted in dilution buffer, typically 
lOx, 2 Ox and 4 Ox, or higher. 
100 ml is added to each well. 
Incubation is for 1 hour at room temperature. 
Unbound material is removed by washing 3x with washing buffer. 
The anti-Mouse IgGl antibody is diluted 2000x in dilution 
buffer. 

100 ml is added to each well. 
Incubation is for 1 hour at room temperature. 

Unbound material is removed by washing 3x with washing buffer. 
Streptavidine is diluted 1000X in dilution buffer. 
100 ml is added to each well. 
Incubation is for 1 hour at room temperature. 

Unbound material -is removed by washing 3x with 300 ml washing 
buffer. 

OPD (0.6 mg/ml) and H 2 0 2 (0.4 ml/ml) is dissolved in citrate 
buffer. 

100 ml is added to each well. 

Incubation is for 10 minutes at room temperature. 
The reaction is stopped by adding 100 ml H 2 S0 4 . 
The plates are read at 4 92 nm with 620 nm as reference. 



Immunisation of mice 

Balb/C mice (20 grams) are immunised 10 times (intervals of 14 
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days) by subcutaneous injection of the modified or unmodified 
polypeptide in question, respectively by standard proceedures 
known in art. 



5 EXAMPLES 
Example 1 

Suitable substitutions in PD498 fo r addition of amino 
10 attachment groups f-Ifflb) 

The 3D structure of parent PD498 was modeled as described 
above based on 59% sequence identity with Thermitase® 
(2tec.pdb) . 

The sequence of PD498 is (see SEQ ID NO. 2). PD498 residue 
15 numbering is used, 1-280. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2.bcl below: 



Conservative substitutions: 

20 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 55,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

25 4 Zone Subset NTERM :1:N Static monomer /residue 10 
Color_Subset 255,255,0 

5 #N0TE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 39, 72, 226 Static monomer/residue 8 
30 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #N0TE: editnextline object name according to the protein 

10 Combine Subse.t REST Difference PD4 9 8 FINALMODEL ALLZONE 
35 11 List Subset REST Atom Output_File restatom. list 

12 List Subset REST monomer/residue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output^File acts iteatom. list 

15 List Subset ACTSITE monomer /residue Output_File 
40 actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
Color_Subset 

18 Combine Subset SUB5A Difference REST5A ACTSITE 
45 19 Combine Subset SUB5B Difference SUB 5 A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

21 List Subset SUB5B Atom Output_File sub5batom. list 

22 List Subset SUB5B monomer/residue Output_File sub5bmole. list 
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23 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list, 
subSbatom. list & accsiteatom. list 

5 

Comments : 

Lines 1-8: The subset ALLZONE is defined as those residues 
which are either within 10 A of the free amino groups on 
lysines or the N-terminal, or within 8 A of the catalytic triad 
10 residues 39, 72 and 226. 

Line 10: The subset REST is defined as those residues not 
included in ALLZONE. 

Lines 17-20: Subset SUB5B is defined as those residues in a 
5 A shell around REST, excluding residues within 8 A of the 
15 catalytic residues. 

Line 23-24: REST contains Arg62 and Argl69, SUB5B contains 
Arg51, Argl21, and Arg250. ACTSITE contains Argl03, but 
position 103 is within 8 A from essential_catalytic_residues, 
and thus not relevant. 
20 The colour codes are: (255,0,255) = magenta, 

(255,255,0)yellow, (255,0,0) red, and (255, 255, 255)= white. 

The substitutions R51K, R62K, R121K, R169K and R250K are 
identified in parent PD4 98 as suitable sites for mutagenesis. 
The residues are substituted below in section 2, and further 
25 analysis done: 



Non-conservative substitutions: 
makeKzone2 . bcl 

I #sourcefile makezone2 .bcl Claus von der Osten 961128 
30 2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #NOTE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace PD498FINALMODEL 
3 5 newmodel 

6 Biopolymer 

7 #NOTE: editnextline object name according to protein 

8 Blank Object On PD498FINALMODEL 

9 #NOTE: editnextlines with lys->arg positions 
40 10 Replace Residue newmodel: 51 lys L 

II Replace Residue newmodel: 62 lys L 

12 Replace Residue newmodel: 121 lys L 

13 Replace Residue newmodel: 169 lys L 

14 Replace Residue newmodel: 250 lys L 
45 15 # 
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16 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

17 Color Molecule Atoms newmodel Specified Specification 
255,0, 255 

5 18 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

19 Zone Subset NTERMx newmodel: 1:N Static monomer /residue 10 
Color_Subset 255,255,0 

20 #N0TE: editnextline ACTSITEx residues according to the 
10 protein 

21 Zone Subset ACTSITEx newmodel: 39 , 72 , 226 Static 
monomer/residue 8 Color_Subset 255,255,0 

22 Combine Subset ALLZONEx Union LYSx NTERMx 

23 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 24 Combine Subset RESTx Difference newmodel ALLZONEx 

25 List Subset RESTx Atom Output File restxatom. list 

26 List Subset RESTx monomer/ resxdue Output_File 
restxmole. list 

27 # 

20 28 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

29 List Subset ACTSITEx Atom Output_File actsitexatom. list 

30 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 

25 31 # 

32 #read restxatom. list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

Comments : 

30 Lines 1-15: Solvent exposed arginines in subsets REST and 

SUB5B are replaced by lysines. Solvent accessibilities are 

recalculated following arginine replacement. 

Lines 16-23: The subset ALLZONEx is defined as those 

residues which are either within 10 A of the free amino groups 
3 5 on Lysines (after replacement) or the N- terminal, or within 8 A 

of the catalytic triad residues 39, 72 and 226. 

Line 24-26: The subset RESTx is defined as those residues 

not included in ALLZONEx, i.e. residues which are still 

potential epitope contributors. Of the residues in RESTx, the 
40 following are >5% exposed (see lists below): 6-7,9-12,43- 

45,65,87-88,209,211,216-221,262. 

The following mutations are proposed in parent PD498: P6K, 

Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, G87K, I88K, 

N209K, A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 
45 Relevant data for Example 1: 

Solvent accessibility data for PD498MODEL: 

# PD498MODEL Fri Nov 29 10:24:48 MET 1996 

# residue area 
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SER_229 0.000000 

PRO_2 3 0 0.2 68693 

HIS_231 2.680759 

VAL_232 0.000000 

5 ALA_233 0.000000 

GLY_23 4 1.074774 

LEU_235 11.500556 

ALA_236 0.000000 

ALA_237 0.000000 

10 LEU_238 1.612160 

LEU_239 0.000000 

ALA_240 10.648088 

SER_241 39.138004 

GLN_242 71.056175 

15 GLY_243 66.487144 

LYS_244 43.256012 

ASN_245 80.728127 

ASN_246 34.859673 

VAL_247 84.145645 

20 GLN_248 51.819775 

ILE_249 8.598188 

ARG_250 35.055809 

GLN_251 71.928093 

ALA_252 0.000000 

25 ILE_253 4.845899 

GLU_254 13.344438 

GLN_255 81.705254 

THR_256 9.836061 

ALA_257 2.810513 

30 ASP_258 44.656136 

LYS_259 113.071686 

ILE_260 32.089527 

SER_261 91.590103 

GLY_262 26.450439 

35 THR_263 38.308762 

GLY_264 46.870056 

THR_2 65 88.551804 

ASN_266 34.698349 

PHE_267 7.756911 

40 LYS_268 103.212852 

TYR_269 37.638382 

- GLY_270 0.000000 
LYS_271 11.376978 
ILE_272 2.885231 

45 ASN_273 19.195255 

SER_274 2.651736 

ASN_275 38.177547 

LYS_276 84.549576 

ALA_277 1.074774 

50 VAL_278 4.775503 

ARG_279 162.693054 

TYR_280 96.572929 

CA_281 0.000000 

CA_282 0.000000 

55 CA_283 8.803203 

Subset REST: 
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restraole. list 
Subset REST: 

PD4 9 8 FINALMODEL : 6-7 , 9-12 , 43-46 , 61-63 , 65, 87- 
89, 111-114, 117-118, 131, 
5 PD4 9 8 FINALMODEL: 137-139 , 158-159 , 169-171 , 173- 
174,180-181,209,211, 

PD4 9 8 FINALMODEL: 216-221, 232-233 , 262 ,E282H 

resta torn. list 
Subset REST: 
10 PD49 8FINALMODEL : PRO 6 : N, CA, CD, C, O, CB, CG 

PD4 9 8 FINALMODEL : TYR 7 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
PD498 FINALMODEL : S ER 9 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL: ALA 10 :N, CA, C,0, CB 

PD4 9 8 FINALMODEL: TYR 11 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
15 PD4 9 8 FINALMODEL :GLN 12 :N, CA, C, O, CB, CG, CD, OE1 ,NE2 

PD498 FINALMODEL : ASP 4 3 : N , CA , C , O , CB , CG , OD 1 , OD2 
PD4 9 8 F IN ALMODEL : TYR 

44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL : ASN 4 5 : N , CA , C , O , CB , CG , OD1 , ND2 
2 0 PD4 9 8 FINALMODEL: HIS 

46:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL : ASP 6 1 : N , CA , C , O , CB , CG , OD1 , OD2 
PD498 FINALMODEL : ARG 
62:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
25 PD4 9 8 FINALMODEL: ASP 63 : N, CA, C, O, CB, CG, ODl , OD2 

PD4 9 8 FINALMODEL : ASN 65 : N , CA , C , O , CB , CG , ODl , ND2 
PD498FINALMODEL: GLY 87:N,CA,C,0 

PD4 9 8 FINALMODEL : ILE 8 8 : N , CA , C , O , CB , CGI , CG2 , CD 1 

PD4 9 8FINALMODEL : GLY 89:N,CA,C,0 
30 PD4 9 8 FINALMODEL: GLY 111:N,CA,C,0 

PD498FINALMODEL: SER 112 : N, CA, C, O, CB , OG 

PD4 9 8 FINALMODEL: LEU 113 : N , CA, C , O , CB , CG , CD1 , CD2 

PD4 9 8 FINALMODEL : ASP 114 : N, CA, C,0 , CB , CG , ODl , OD2 

PD4 9 8 FINALMODEL: ALA 117 : N, CA, C,0, CB 
35 PD4 9 8 FINALMODEL: SER 118 :N, CA, C, O, CB , OG 

PD4 9 8 FINALMODEL : LEU 13 1 : N , CA , C , O , CB , CG , CD1 , CD 2 

PD4 9 8 FINALMODEL: CYS 137:N,CA,C,0,CB,SG 

PD498 FINALMODEL : GLU 
138:N,CA,C,0,CB,CG,CD,OEl,OE2 
40 PD4 9 8 FINALMODEL: CYS 139 :N , CA, C,0, CB, SG 

PD4 9 8 FINALMODEL :VAL 158 : N , CA, C,0, CB , CGI , CG2 

PD4 9 8FINALMODEL : VAL 159 :N , CA, C ,0, CB, CGI , CG2 

PD4 9 8FINALMODEL : ARG 
169:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
45 PD4 9 8 FINALMODEL :THR 170:N,CA,C,0,CB,OG1,CG2 

PD4 9 8 FINALMODEL : PHE 
171:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD498 FINALMODEL : PRO 1 7 3 : N , CA , CD , C , O , CB , CG 

PD4 9 8 FINALMODEL: ALA 174 :N,CA,C,0,CB 
50 PD4 9 8FINALMODEL : ILE 180 :N, CA, C,0, CB, CGI , CG2 , CD1 

PD4 9 8 FINALMODEL: ALA 181:N,CA,C,0,CB 

PD4 9 8 FINALMODEL : ASN 209 :N, CA, C, O , CB , CG, ODl , ND2 

PD4 9 8 FINALMODEL : ALA 2 1 1 : N , CA , C , O , CB 

PD4 9 8 FINALMODEL: ASN 2 1 6 : N , CA , C , O , CB , CG , ODl , ND2 
55 PD4 9 8 FINALMODEL: ASN 217 : N, CA, C , O, CB , CG, ODl , ND2 

PD4 9 8 FINALMODEL: GLY 218:N,CA,C f O 
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PD4 98FINALMODEL: TYR 

219:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
P04 9 8 FINALMODEL : SER 220: N, CA, C, O, CB,OG 
PD4 9 8 FINALMODEL : TYR 
5 221:N,CA,C,0, CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD498 FINALMODEL : VAL 2 3 2 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 FINALMODEL: ALA 233 :N, CA, C, O, CB 
PD4 98FINALMODEL : GLY 262:N,CA,C / 0 
PD4 9 8 FINALMODEL :CA E282H:CA 

10 

Subset SUB5B: 

sub5bmole. list 
Subset SUB5B: 

PD4 9 8 FINALMODEL: 4-5, 8 ,13 -16, 34 -3 5, 47- 
15 51,53,64,83,85-86,90-91,120-124, 

PD4 9 8 FINALMODEL: 128-130, 140-141, 143-144 , 147- 
148 , 151-152 , 156-157 , 

PD4 9 8 FINALMODEL: 165 , 167-168 , 172 , 175-17 6, 178- 
179, 196,200-205,208, 
20 PD4 9 8 FINALMODEL: 234-237, 250, 253-254, 260-261, 2 63- 

267,272,E281H, 

PD498 FINALMODEL : E 2 8 3 H 

sub5batom. list 

25 Subset SUB5B: 

PD4 9 8 FINALMODEL : ASN 4 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : ASP 5 : N , CA , C , O , CB , CG , OD1 , OD2 
PD498 FINALMODEL : TYR 
8:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

30 PD4 9 8 FINALMODEL: TYR 

13 :N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL: GLY 14:N,CA,C,0 
PD4 9 8 FINALMODEL : PRO 15 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL : GLN 1 6 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

35 PD4 9 8 FINALMODEL :THR 34 :N, CA, C, O, CB, OG1 , CG2 

PD498 FINALMODEL : VAL 3 5 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : PRO 4 7 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL: ASP 48 :N, CA, C, O, CB, CG, OD1 , OD2 
PD49 8 FINALMODEL : LEU 4 9 : N , CA , C , O , CB , CG , CD1 , CD2 

40 PD4 9 8 FINALMODEL: ALA 50 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL : ARG 

51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL: VAL 53 :N, CA, C, O, CB , CGI , CG2 
PD4 9 8 FINALMODEL: ASN 64 :N, CA, C, O, CB, CG, OD1 , ND2 

45 PD4 9 8 FINALMODEL: ASP 83 :N, CA, C, 0, CB , CG, OD1 , OD2 

PD4 9 8 FINALMODEL: ASN 85:N,CA,C,0,CB,CG,ODl,ND2 
PD4 9 8FINALMODEL : ASN 86:N,CA,C,0,CB,CG,ODl,ND2 
PD4 9 8FINALMODEL : VAL 90:N,CA,C,0,CB,CG1,CG2 
PD4 9 8 FINALMODEL: ALA 91:N,CA,C,0,CB 

50 PD4 9 8 FINALMODEL :ILE 120 :N, CA, C,0, CB , CGI, CG2 , CD1 

PD498 FINALMODEL : ARG 

121:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL : TYR 
122:N,CA,C,O f CB,CG,CDl,CD2,CEl,CE2,CZ,OH 

55 PD4 9 8 FINALMODEL: ALA 123 : N, CA, C,0, CB 

PD498 FINALMODEL : ALA 1 2 4 : N , CA , C , O , CB 
PD498 FINALMODEL : ALA 1 2 8 : N , CA , C , O , CB 
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PD498FINALMODEL: LYS 129 :N , CA, C, O, CB, CG, CD , CE, NZ 

PD4 98FINALMODEL : VAL 13 0 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ASN 140:N,CA,C,0,CB,CG,OD1, ND2 

PD4 98FINALMODEL : SER 14 1 : N , CA, C , O , CB , OG 

5 PD 4 9 8 F INALMODEL : THR 143 :N, CA, C , O, CB , OG1 , CG2 

PD498FINALMODEL : LEU 144 : N , CA, C , O , CB , CG , CD1 , CD 2 

PD4 9 8 FINALMODEL : ALA 147:N,CA,C,0,CB 

PD498FINALMODEL : VAL 148 : N , CA, C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ALA 151 :N, CA, C, O, CB 
10 PD498FINALMODEL: TRP 

52:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3, 

CZ2,CZ3,CH2 
PD4 9 8 FINALMODEL: ALA 156 :N, CA, C , O, CB 
PD4 9 8 FINALMODEL : VAL 157 : N , CA , C , O , CB , CGI , CG2 
15 PD4 9 8 FINALMODEL: ASP 165:N,CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL : VAL 167:N,CA,C / 0,CB,CG1,CG2 
PD4 9 8 FINALMODEL : SER 168 : N , CA, C , O , CB , OG 
PD4 9 8 FINALMODEL : GLN 

172:N,CA,C,0,CB,CG,CD,0E1,NE2 
20 PD4 9 8 FINALMODEL: SER 175 :N , CA, C , O, CB , OG 

PD498 FINALMODEL : TYR 

176:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL : ASN 178 : N , CA , C , O , CB , CG , OD1 , ND2 
PD498 FINALMODEL : ALA 179 : N , CA , C , O , CB 
25 PD4 9 8 FINALMODEL: ASN 196 : N, CA, C , O, CB , CG ,ODl , ND2 

PD4 98 FINALMODEL : TRP 

200:N,CA,C,O,CB,CG,CDl,CD2,NEl,CE2 / CE3, 

CZ2,CZ3,CH2 
PD4 9 8 FINALMODEL: VAL 201 :N , CA, C, O, CB , CGI , CG2 

30 PD4 9 8 FINALMODEL: ASP 202 :N , CA, C, O, CB , CG, OD1 , OD2 

PD4 9 8 FINALMODEL: VAL 2 03 : N , CA , C , 0 , CB , CGI , CG2 
PD4 9 8 FINALMODEL : THR 2 04 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL : ALA 2 05 : N , CA , C , O , CB 
PD4 9 8 FINALMODEL: VAL 208 :N, CA, C, O, CB , CGI , CG2 

35 PD4 9 8 FINALMODEL :GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL: LEU 235 :N, CA, C, O, CB , CG, CD1 , CD2 
PD4 9 8 FINALMODEL: ALA 236 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ALA 2 3 7 : N , CA , C , O , CB 
PD4 9 8 FINALMODEL : ARG 

40 250:N,CA,C,0 / CB,CG,CD,NE,CZ,NH1,NH2 

PD4 9 8 FINALMODEL : ILE 253 : N , CA , C , O , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL : GLU 

254:N,CA / C,0,CB,CG,CD,OEl,OE2 
PD4 9 8 FINALMODEL : ILE 2 6 0 : N , CA , C , O , CB , CGI , CG2 , CD1 

45 PD4 9 8 FINALMODEL: SER 261 :N, CA, C, O, CB, OG 

PD498 FINALMODEL : THR 2 6 3 : N , CA , C , O , CB , OG 1 , CG2 
PD4 9 8 FINALMODEL : GLY 264:N,CA,C,0 
PD498 FINALMODEL : THR 2 65 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL : ASN 2 6 6 : N , CA , C , O , CB , CG , OD1 , ND2 

50 PD4 9 8 FINALMODEL :PHE 

267:N,CA,C,0,CB, CG,CD1,CD2,CE1,CE2,CZ 
PD4 9 8 FINALMODEL : ILE 2 7 2 : N , CA , C , O , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL :CA E281H:CA 
PD4 9 8FINALMODEL : CA E283H:NA 

55 

Subset ACTSITE: 

actsitemole. list 



- WO 98/35026 



52 



PC17DK98/00046 



Subset ACTSITE: 

PD498FINALMODEL: 36-42 , 57-60, 66-80 , 100-110 , 115- 
116,119,132-136,160-164, 

PD498FINALMODEL: 182-184 , 194 , 206-207 , 210, 212- 
5 215,222-231 



36:N,CA 
37:N, 
38:N, 
39:N, 
40:N, 
41:N,w» 
42:N,CA 



, CG , CD1 
58:N,CA 

, CG , CD1 
60:N,CA 
66:N,CA 
67:N,CA 
68:N,CA 
69:N,CA 
70:N,CA 
71:N,CA 

,CG,ND1 
73:N,CA 
74:N,CA 



acts i t eat om. list 
Subset ACTSITE: 

PD4 9 8FINALMODEL : ALA 

10 PD4 9 8 FINALMODEL : VAL 

PD498 FINALMODEL : LEU 
PD4 9 8 FINALMODEL : ASP 
PD 4 9 8 FINALMODEL : SER 
PD498FINALMODEL: GLY 

15 PD4 9 8 FINALMODEL : VAL 

PD498 FINALMODEL : TYR 
57:N f CA,C,0,CB 
PD49 8 FINALMODEL : ASP 
PD 4 9 8 F INALMODEL : PHE 

20 59:N,CA,C,0,CB 
PD4 9 8 FINALMODEL: ILE 
PD4 9 8 FINALMODEL : PRO 
PD4 9 8 FINALMODEL : MET 
PD4 9 8 FINALMODEL : ASP 

25 PD498FINALMODEL: LEU 

PD4 9 8 FINALMODEL : ASN 
PD498 FINALMODEL : GLY 
PD4 98FINALMODEL:HIS 
72:N,CA,C,0,CB 

30 PD4 9 8FINALMODEL : GLY 

PD4 9 8 FINALMODEL : THR 
PD4 9 8FINALMODEL : HIS 

75:N,CA / C,0,CB 
PD4 9 8FINALM0DEL : VAL 

35 PD4 9 8FINALMODEL : ALA 

PD4 9 8 FINALMODEL : GLY 
PD4 9 8FINALMODEL : THR 
PD49 8 FINALMODEL : VAL 
PD498FINALMODEL: LEU 

40 PD4 9 8 FINALMODEL : ALA 

PD4 9 8FINALMODEL : VAL 
PD4 9 8 FINALMODEL : ARG 

103:N,CA,C,O,CB,CG,CD,NE, 
PD4 9 8 FINALMODEL: VAL 104 :N, 

45 PD4 9 8 FINALMODEL: LEU 105 :N, 

PD4 9 8 FINALMODEL: ASP 106 :N, 
PD498FINALMODEL: ALA 107 :N, 
PD4 9 8 FINALMODEL: ASN 108 :N, 
PD4 9 8 FINALMODEL: GLY 109 :N, 

50 PD498FINALMODEL: SER 110 :N, 

PD4 9 8 FINALMODEL: SER 115 :N, 
PD4 9 8 FINALMODEL: ILE 116 :N, 
PD4 9 8 FINALMODEL: GLY 119 :N, 
PD4 9 8 FINALMODEL: ASN 132 :N f 

55 PD4 98 FINALMODEL: LEU 133 :N, 

PD498FINALMODEL:SER 134 :N, 
PD4 9 8 FINALMODEL: LEU 135 :N, 



CA 
CA 
CA 
CA 
CA 



C,0,CB 

C,0,CB,CG1,CG2 
C,0,CB,CG,CD1,CD2 

, C,0,CB,CG,OD1,OD2 

, C,0,CB,OG 

rC,0 

,C,0,CB,CG1,CG2 

CD2,CE1,CE2,CZ,0H 
C,0,CB,CG,ODl,OD2 

CD2,CE1,CE2,CZ 
,C,0,CB,CG1,CG2,CD1 
,CD,C,O f CB,CG 
,C,0,CB,CG,SD,CE 
, C,0,CB,CG,ODl,OD2 
,C,0,CB,CG,CD1,CD2 
,C,0,CB,CG,0D1,ND2 
,C,0 

CD2,CE1,NE2 
C,0 

C,0,CB,0G1,CG2 



, CG , ND1 , CD2 , CE1 , NE2 
76:N,CA,C,0,CB,CG1,CG2 
77:N,CA,C,0,CB 
78:N,CA,C,0 

79:N,CA,C,0,CB,0G1,CG2 
80:N,CA,C,0,CB,CG1,CG2 
100 :N,CA, 0,0,06,00,001,002 
101:N,CA,C,O,CB 
102:N,CA,C,O,CB,CGl,CG2 



CZ,NH1 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C f O 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 
CA,C,0 



,NH2 
?,CB,CG1,CG2 
),CB,CG,CD1,CD2 
D,CB,CG,ODl,OD2 
},CB 
,CB,CG,ODl,ND2 

i 

, CB , OG 
,CB,OG 

,06,001,062,001 

> 

),CB,CG,ODl,ND2 
>,CB,CG,CD1,CD2 
>, CB,OG 

,CB,CG,CD1,CD2 
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PD498FINALMODEL:GLY 136:N,CA,C,0 
PD49 8 FINALMODEL : ALA 1 6 0 : N , CA , C , O , CB 
PD4 98 FINALMODEL : ALA 1 6 1 : N , CA , C , O , CB 
PD4 9 8 FINALMODEL : ALA 162:N,CA,C,0,CB 
5 PD4 9 8 FINALMODEL: GLY 163:N,CA,C,0 

PD498 FINALMODEL : ASN 1 6 4 : N , CA , C , O , CB , CG , OD1 , ND2 
PD4 9 8 FINALMODEL : VAL 18 2 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : GLY 183:N,CA,C,0 
PD4 9 8 FINALMODEL: ALA 184 :N,CA,C,0, CB 
10 PD498FINALMODEL: PHE 

194 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
PD4 9 8 FINALMODEL: PRO 206 :N, CA, CD, C, O, CB, CG 
PD4 9 8 FINALMODEL: GLY 207:N,CA,C,O 

PD4 9 8 FINALMODEL : ILE 2 10 : N , CA , C, O , CB , CGI , CG2 , CD1 
15 PD4 9 8 FINALMODEL: SER 212 :N, CA, C, O, CB, OG 

PD498 FINALMODEL : THR 2 13 : N , CA , C , O , CB , OG 1 , CG2 
PD498 FINALMODEL : VAL 2 14 : N , CA , C , O , CB , CG 1 , CG2 
PD4 9 8 FINALMODEL: PRO 215:N,CA,CD,C,0,CB,CG 
PD4 9 8 FINALMODEL: MET 222 :N, CA, C, O, CB, CG , SD, CE 
20 PD4 9 8 FINALMODEL: SER 223 :N,CA,C,0, CB,OG 

PD4 9 8 FINALMODEL: GLY 224:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 2 2 5 : N , CA , C , O , CB , OG1 , CG2 
PD4 9 8 FINALMODEL: SER 226:N,CA,C,0,CB,OG 
PD4 9 8 FINALMODEL: MET 227 :N, CA, C, O, CB, CG, SD, CE 
25 PD4 9 8 FINALMODEL : ALA 228 :N, CA, C, O, CB 

PD 4 9 8 FINALMODEL : SER 2 2 9 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL : PRO 23 0 : N , CA, CD , C , O , CB , CG 
PD498 FINALMODEL : HIS 
231:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 

Subset RESTx: 

restxmole . list 
Subset RESTX: 

NEWMODEL: 6-7 , 9-12 ,43-46 , 65 , 87- 
35 89,131,173,2 09,211,216-221,232-233, 
NEWMODEL : 2 62 , E2 8 2H 

restxatom. list 
Subset RESTX: 
40 NEWMODEL : PRO 6:N,CA,CD,C,0,CB,CG 

NEWMODEL : TYR 
7:N,CA,C / 0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL : SER 9:N,CA,C,0,CB,OG 
NEWMODEL: ALA 10:N,CA,C,0,CB 
45 NEWMODEL: TYR 

ll:N,CA,C,O,CB,CG,CDl f CD2,CEl,CE2,CZ,0H 

NEWMODEL: GLN 12 : N, CA, C, O, CB , CG, CD ,OEl , NE2 
NEWMODEL : ASP 43:N,CA,C,0,CB,CG,ODl,OD2 
NEWMODEL: TYR 
50 44:N,CA,C,0,CB,CG,CD1,CD2,CE1 / CE2,CZ,0H 

NEWMODEL: ASN 45 : N, CA, C, O, CB , CG, OD1 , ND2 
NEWMODEL : HIS 46 :N, CA, C,0, CB, CG,ND1, CD2 , CE1,NE2 
NEWMODEL: ASN 65:N,CA,C,0,CB,CG,0D1,ND2 
NEWMODEL : GLY 87:N,CA,C,0 
55 NEWMODEL : ILE 88 :N, CA, C, O, CB , CGI , CG2 , CD1 

NEWMODEL : GLY 89:N,CA,C,0 

NEWMODEL : LEU 131 : N, CA, C, O , CB, CG , CD1 , CD2 
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NEWMODEL : PRO 173:N,CA,CD,C,0,CB,CG 

NEWMODEL.'ASN 209:N,CA,C,O,CB,CG,ODl,ND2 

NEWMODEL: ALA 211:N,CA,C,0,CB 

NEWMODEL : ASN 2 1 6 : N , CA , C , O , CB , CG , ODl , ND2 
5 NEWMODEL.'ASN 217 :N, CA, C, 0, CB, CG, ODl ,ND2 

NEWMODEL : GLY 218:N,CA,C,0 

NEWMODEL :TYR 
2 1 9 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

NEWMODEL : SER 220:N, CA,C,0,CB,OG 
10 NEWMODEL : TYR 

221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWMODEL: VAL 232 :N, CA, C, O, CB, CGI , CG2 

NEWMODEL: ALA 233 :N, CA, C, O, CB 

NEWMODEL : GLY 262:N,CA,C,0 
15 NEWMODEL :CA E282H:CA 

Example 2 

Suitable substitutions in Savinase® for addition of amino 
20 attachment groups (-NH ?) 

The known X-ray structure of Savinase® was used to find 
where suitable amino attachment groups may is added (Betzel et 
al, (1992), J. Mol. Biol. 223, p. 427-445). 

The 3D structure of Savinase® is available in the Brookhaven 
25 Databank as lsvn.pbd. A related subtilisin is available as 
lst3.pdb. 

The sequence of Savinase® is shown in SEQ ID NO. 3 
The sequence numbering used is that of subtilisin BPN ' , 
Savinase® having deletions relative to BPN 1 at positions: 36, 
30 56, 158-159 and 163-164. The active site residues (functional 
site) are D32,H64 and S221. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2.bcl below: 

35 Conservative substitutions: 
makeKzone . bcl 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
Zone Subset LYS :lys:NZ Static monomer /residue 10 Color_Subset 
40 255,255,0 

Zone Subset NTERM :el:N Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE : e32 , e64 , e221 Static monomer/residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union LYS NTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#NOTE: editnextline object name according to the protein 
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Combine Subset REST Difference SAVI8 ALLZONE 
List Subset REST Atom Output File rest atom. list 
List Subset REST monomer/ resTdue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 List Subset ACTSITE Atom Output File actsiteatom. list 
List Subset ACTSITE monomer/ resTdue Output_File 
actsitemole . list 
# 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
10 Combine Subset SUB 5 A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File subsbatom. list 
List Subset SUB5B monomer /residue Output_File sub5bmole. list 
15 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

#Use grep command to identify ARG in restatom. list , 
sub5batom. list & accsiteatom. list 

20 Comments: 

In this case of Savinase® REST contains the Arginines ArglO, 
Argl70 and Arg 186, and SUB5B contains Argl9, Arg45, Argl45 and 
Arg247 . 

These residues are all solvent exposed. The substitutions 
25 R10K, R19K, R45K, R145K, R170K, R186K and R247K are identified 
in Savinase® as sites for mutagenesis within the scope of this 
invention. The residues are substituted below in section 2, 
and further analysis done. The subset ACTSITE contains Lys94. 
The substitution K94R is a mutation removing Lysine as 
30 attachment group close to the active site. 



Non-conservative substitutions : 
makeKzone2 .bcl 

#sourcefile makezone2 . bcl Claus von der Osten 961128 
35 # 

/having scanned lists (grep arg command) and identified sites 
for lys->arg substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace SAVI8 newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On SAVI8 

#NOTE: editnextlines with lys->arg positions 

Replace Residue newmodel :el0 lys L 
45 Replace Residue newmodel: el 70 lys L 

Replace Residue newmodel : el86 lys L 

Replace Residue newmodel: el 9 lys L 

Replace Residue newmodel:e45 lys L 

Replace Residue newmodel :el4 5 lys L 
50 Replace Residue newmodel :e2 41 lys L 
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# 

#Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
5 Zone Subset LYSx newmodel : lys : NZ Static monomer/residue 10 
Color_Subset 255 , 255, 0 

Zone Subset NTERMx newmodel : el: N Static monomer/residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel :e3 2, e64 ,e221 Static 

monomer/ residue 8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union LYSx NTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

Combine Subset RESTx Difference newmodel ALLZONEx 
15 List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole . list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
List Subset ACTSITEx Atom Output File actsitexatom. list 
20 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
(not_arg) ->lys subst. if needed 

25 

Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 5,14,22,38-40,42,75-76,82,86,103-105,108,133- 
135,137,140,173,204,206,211-213,215-216,269. The following 
30 mutations are proposed in Savinase®: P5K, P14K, T22K, T38K, 

H39K, P40K, L42K, L75K, N76K, L82K, P86K, S103K, V104K, S105K, 
A108K, A133K, T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, 
G211K, S212K, T213K, A215K, S216K, N269K. 
Relevant data for Example 2: 

35 Solvent accessibility data for SAVINASE® : 

# SAVI8NOH20 Fri Nov 29 13:32:07 MET 1996 

# residue area 





ALA 


1 


118.362808 




GLN~ 


"2 


49.422764 


40 


SER~ 


"3 


61.982887 




VAL~ 


"4 


71.620255 




PRO" 


"5 


21.737535 




TRP" 


"6 


58.718731 




GLY~ 


"7 


4.328117 


45 


ile" 


"8 


6.664074 




ser" 


"9 


60.175900 




arg" 


"10 


70.928963 




val" 


"ll 


2.686934 




gln" 


"12 


72.839996 


50 


ALA" 


"13 


0.000000 




PRO 


"14 


52.308453 




ALA 


"15 


38.300892 




ALA" 


"16 


0. 000000 
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HIS_ 


17 


41 . 826324 




ASN_ 


18 


136 . 376602 




ARG~ 


19 


105 . 678642 




GLY~ 


20 


48 . 231510 


5 


leu" 


[21 


17 . 196377 




THR~ 


22 


36 . 781742 




GLY_ 


23 


0 . 000000 




SER_ 


24 


64 • 151276 




GIjY_ 


~1 r- 


ou • 2o9905 


1U 


VALi 


£b 


A f\1 f\ A A 1 

4 • O 3U4U1 




T If p 

LYS_ 


27 


54 • 239555 




VAJj_ 


28 


U • UUUUUU 




ALA_ 


29 


o . oooooo 




VAL_ 


30 


3 . 572827 


15 


LEU_ 


31 


0 .233495 




ASP_ 


32 


1.074774 




THR~ 


33 


1.973557 




GLY~ 


34 


3 . 638052 




ILE_ 


35 


8 . 044439 


20 


SER_ 


36 


8 . 514903 




THR 


37 


122 . 598907 




HIS 


38 


18 . 834011 




PRO 


39 


76 . 570526 




ASP_ 


40 


0 . oooooo 


25 


LEU_ 


41 


19 . 684013 




ASN 


42 


88 . 870216 




ILE_ 


43 


56 . 117710 




ARG_ 


44 


110 . 647194 




GLY_ 


45 


26 .935413 


30 


GLY_ 


46 


35 . 515778 




ALA 


47 


21 . 495472 




SER_ 


48 


34 . 876190 




PHE_ 


49 


52 . 647541 




VAL_ 


50 


23 . 364208 


35 


PRO_ 


51 


110 .408752 




GLY_ 


52 


80 . 282906 




GLU_ 


53 


43 . 033707 




PRO 


54 


1 1 i( A A /I O "5 

124 • 44433 6 






3D 


60 • 28488^ 


40 


THR 


56 


47 . 103241 




GLiN_ 


57 


I*"*/"* O AO C AC 

120 • 803505 




ASP_ 


™c o 
DO 


12 .784743 




/™<T* XT 

GLY_ 


59 


61 . 742443 




ASN_ 


60 


56 . 760231 


45 


GLY 


61 


1 . 576962 




HIS_ 


62 


38 . 590118 




GLY_ 


63 


0. oooooo 




THR^ 


64 


0 . 537387 




UTc" 

nio 






50 


VAL" 


"66 


1.612160 




ALA" 


"67 


0.000000 




GLY" 


"68 


2.801945 




THR" 


"69 


9.074596 




ILE" 


"70 


0.000000 


55 


ALA" 


"71 


4.577205 




ALA" 


"72 


0.000000 




LEU" 


"73 


47.290039 
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58 





ASN 


74 


102.187248 




asn" 


"75 


60.210400 




ser" 


"76 


84.614494 




ILE 77 


66.098572 


5 


GLY 


78 


17.979534 




VAL" 


"79 


5.642561 




LEU" 


"80 


13 .025185 




GLY" 


"81 


0.000000 




VAL" 


82 


0.268693 


10 


ALA" 


"83 


0.000000 




PRO" 


"84 


18. 193810 




SER~ 


"85 


56.839039 




ALA" 


"86 


13.075745 




GLU" 


"87 


37.011765 


15 


LEU 88 


2.149547 




TYR 


89 


30.633518 




ALA" 


"90 


1.343467 




VAL~ 


"91 


0.779450 




LYS" 


"92 


5.862781 


20 


VAL" 


"93 


0.466991 




LEU" 


"94 


10.747736 




GLY" 


"95 


8.707102 




ALA" 


"96 


41.414677 




SER~ 


"97 


96.066040 


25 


GLY" 


"98 


33.374485 




SER 


"99 


67. 664116 




GLY" 


"100 


35.571117 




SER~ 


"101 


54.096992 




VAL 


"102 


52.695324 


30 


SER" 


"103 


62.929684 




ser" 


"104 


8. 683097 




ILE" 


"105 


15.852910 




ALA 


"106 


14.509443 




GLN" 


JL07 


94.463066 


35 


GLY 


108 


0.000000 




LEU 


"109 


0.537387 




GLU" 


"no 


63.227707 




TRP" 


"ill 


55.500740 




ALA" 


"112 


0.502189 


40 


GLY" 


"113 


11.908267 




asn" 


"114 


107.208527 




asn" 


"115 


78.811234 




gly" 


"116 


41.453194 




met" 


"117 


9.634291 


45 


his" 


"118 


54.022118 




VAL" 


"119 


5.105174 




ALA 


"120 


0.268693 




ASN 


"121 


0.233495 




LEU~ 


"122 


0.537387 


50 


SER" 


"123 


4 . 004620 




LEU" 


"124 


21.927265 




gly" 


"125 


55.952454 




SER~ 


"126 


40.241180 




pro" 


"127 


107.409439 


55 


ser" 


"128 


57.988609 




pro" 


'129 


85.021118 




SER~ 


"130 


20.460915 
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59 





ALA 


131 


57.404362 




thr" 


"132 


74.438805 




leu" 


*133 


12.091203 




GLU~ 


"134 


73 .382019 


5 


GLN~ 


"135 


114.870010 




ALA~ 


"136 


2 . 122917 




VAL~ 


137 


1.074774 




ASN 


"138 


55.622704 




SER 


"139 


29.174965 


10 


ALA 


"140 


0. 268693 




THR~ 


"141 


27.962946 




SER" 


"142 


87.263145 




ARG 


"143 


88.201218 




GLY~ 


"144 


38.477882 


15 


VAL~ 


"145 


2.079151 




LEU~ 


'146 


13 .703363 




VAL 


"147 


2. 690253 




val" 


"148 


1. 074774 




ALA~ 


"149 


0. 000000 


20 


ALA 


"150 


4 . 356600 




SER~ 


"151 


0. 000000 




GLY - 


"152 


12 . 628590 




ASN~ 


"153 


84.248703 




SER 154 


77.662354 


25 


GLY 


155 


25.409861 




ALA~ 


"156 


38.074570 




GLY 


"157 


40.493744 




SER" 


'158 


53 .915291 




ile" 


"159 


4.352278 


30 


SER - 


"160 


12 .458543 




TYR~ 


"161 


29.670284 




PRO 


"162 


4.030401 




ALA" 


"163 


0.968253 




ARG 


"164 


84. 059120 


35 


TYR 


"165 


28.641129 




ALA 


"166 


68.193314 




ASN" 


"167 


61.686481 




ALA" 


"168 


0.537387 




met - 


"169 


0. 586837 


40 


ala" 


"170 


0.000000 




VAL 


"171 


0. 000000 




GLY 


"172 


0.000000 




ALA - 


"173 


0. 933982 




THR 


'174 


3.013133 


45 


ASP 


"175 


34.551376 




GLN" 


"176 


96.873039 




ASN 


"177 


98.664368 




ASN" 


"178 


41.197159 




ASN" 


"179 


60.263512 


50 


arg" 


"180 


64.416336 




ALA" 


"181 


7.254722 




SER" 


"182 


91.590881 




PHE 


"183 


52.126518 




SER - 


"184 


2. 101459 


55 


GLN" 


"185 


15.736279 




tyr" 


"186 


44.287792 




GLY" 


"187 


5. 114592 
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ALA 


188 


69 - 406563 




GLY 189 


36 . 926083 




LEU 


190 


16.511177 




asp" 


191 


7. 705349 




ILE~ 


192 


0.268693 




VAL~ 


193 


4 . 299094 




ALA~ 


194 


0 . 000000 




PRO - 


195 


0.806080 




GLY~ 


196 


0. 000000 


i n 


VAL~ 


197 


9^ 957177 




ASN~ 


198 


O ^ • X / / *f & ^ 




VAL~ 


199 


i 0 74.77^6 




GLN~ 


200 


on. 374527 




SER~ 


201 


2 008755 


1 ^ 


THR~ 


202 


0 OOOQOO 




tyr" 


203 


80 . 679886 




pro" 


204 


34 . 632195 




GLY~ 


205 


74 . 536827 




SER~ 


206 


74 .964920 


20 


THR~ 


207 


57 . 070065 




TYR~ 


"2 08 


82 895500 




ALA~ 


209 


00 R18Q40 




SER~ 


210 


69 045639 




LEU~ 


211 




OK 


ASN~ 


212 






GLY~ 


213 


^ * tJO V .J ** 




thr" 


214 


*X • \J \J ^ ^ \J J 




SER~ 


215 


15 225292 




MET - 


"2 16 


7 . 261287 


30 


ALA~ 


"217 


0. 000000 




THR~ 


218 


0. 000000 




pro" 


219 


0. 806080 




his" 


220 


2 . 662697 




VAL~ 


"221 


0 . 268693 


35 


ALA~ 


222 


0. 000000 




GLY~ 


223 


0 . 000000 




ALA~ 


224 


7 - 206634 




ALA~ 


225 


1 . 039576 




ALA~ 


226 


0 268693 




LEU - 


227 


1 074774 




val" 


"228 


1 ^41764 




LYS - 


"229 


39 262505 




GLN~ 


"230 


54 .501614 




LYS~ 


"231 


81 . 154129 


45 


asn" 


"232 


30 . 004124 




pro" 


"233 


91 . 917931 




ser" 


"234 


102 . 856705 




TRP" 


"235 


64 . 639481 




SER" 


"236 


51.797619 


50 


asn" 


"237 


24.866917 




VAL 238 


78.458466 




GLN 239 


73.981461 




ILE 


240 


14.474245 




arg" 


"241 


41.242931 


55 


asn" 


"242 


64.644814 




his" 


"243 


50.671440 




leu" 


~244 


5.127482 
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61 



LYS 


245 


48 . 820000 


ASN_ 


246 


115 . 264534 


TMK_ 


24 / 


22 • 2U5 J /o 


H T JV 

AJjA_ 


J 4 o 


i.O.4 iSU / / 


TrlK_ 




/- /-j K /"V O 1 n 1 

ou . dUjXOI 








T 1?TT~ 














"OKI 


IS. 011491 


XIlK 


2 34 


OO . 42 170£ 


Ik O XT 

AS>N_ 


255 


/2 . 490181 


LEU 


O C £T 

256 


54 . 835758 


TYR_ 


25 / 


38 . 798912 


GLY_ 


258 


3 . 620916 


SER_ 


259 


35 . 017368 


GLY_ 


260 


0 . 537387 


JjEU_ 


2 61 


o • 598188 


Y7JI T 

VAIj_ 


_262 


4 • 019 700 


ASN 


263 


16 . 763659 


ALA" 


"264 


3.413124 


GLU" 


"265 


37.942276 


ALA 


"266 


15.871746 


ALA 


"267 


3.947115 


thr" 


"268 


2.475746 


arg" 


"269 


176.743362 


ION 


"270 


0.000000 


ion" 


"271 


5.197493 



Subset REST: 

restmole. list 
30 Subset REST: 

SAVI8 : E5-E15 , E17-E18 , E22 , E38-E40 , E42-E43 , E73-E76 , E82-E86 , E103- 
E105, 

SAVI8 : E108-E109 , E111-E112 , E115-E116 , E122 , E128-E144 , E149- 

Ei50,Ei56-Ei57, 

35 SAVI8 : E160-E162 , E165-E168 , E170-E171 , E173 , E180-E188 , E190- 
E192,E200, 

SAVI8 : E203-E204 , E206 , E211-E213 , E215-E216 , E227-E230 , E255- 
E259,E261-E262, 
SAVI8:E267-E269 
40 restatom. list 
Subset REST: 

SAVI8:PR0 E5:N,CD,CA,CG,CB,C,0 

SAVI8 : TRP E6 : N , CA , CO 2 , CE2 , NE1 , CD1 , CG , CE3 , CZ3 , CH2 , CZ2 , CB , C , O 
SAVI8:GLY E7:N,CA,C,0 
45 SAVI8:ILE E8:N,CA,CD1,CG1,CB,CG2 ,C,0 
SAVI8:SER E9:N,CA,0G,CB,C,0 

SAVI8:ARG E10:N,CA,NH2,NH1 > CZ,NE,CD,CG,CB / C,0 
SAVI8:VAL Ell :N,CA,CG2 ,001,06,0,0 
SAVI8:GLN E12 :N,CA,NE2 ,0E1,CD,CG,CB,C,0 
50 SAVI8:ALA E13 : N, CA, CB, C, 0 

SAVI8:PR0 E14:N,CD,CA,CG,CB,C,0 
SAVI8:ALA E15:N,CA,CB, C,0 

SAVI8:HIS E17:N,CA,CD2 ,NE2 , CE1 , ND1 , CG , CB, C, 0 
SAVI8:ASN E18 :N, CA,ND2 ,0D1,CG, CB,C, O 
55 SAVI8:THR E22 :N, CA, CG2 ,0G1,CB, C, 0 
SAVI8:THR E38 :N,CA, CG2 ,OGl,CB, 0,0 
SAVI8:HIS E39:N,CA,CD2,NE2,CEl,NDl,CG,0B,C,O 
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SAVI8 : 


PRO 


E40 :N, CD, CA, CG, CB, 0,0 




SAVI8 : 


LEU 


E42:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


ASN 


E43 :N, CA,ND2 , OD1, CG,CB,C,0 




SAVI8 : 


ALA 


E73:N,CA,CB,C,0 


5 


SAVI8: 


ALA 


E74:N,CA,CB,C,0 




SAVI8 : 


LEU 


E75:N / CA / CD2,CD1,CG,CB,C,0 




SAVI8 : 


ASN 


E76:N / CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


LEU 


E82 :N, OA, CD2 , CD1, CG,CB, 0,0 




SAVI8 : 


GLY 


E83:N,CA,C,0 


10 


SAVI8: 


VAL 


E84 :N, OA, CG2 , CGI, CB,C,0 




SAVI8: 


ALA 


E8 5 : N , CA , CB , C , O 




SAVI8 : 


PRO 


E86:N,CD,CA,CG,CB,C,0 




SAVI8: 


SER 


E103: 


N,CA,OG,CB,C,0 




SAVI8: 


VAL 


E104: 


N,CA,CG2,CG1,CB,C,0 


15 


SAVI8: 


SER 


E105: 


N,CA,0G,CB,C,0 




SAVI8: 


ALA 


E108: 


N,CA,CB,C,0 




SAVI8: 


GLN 


E109: 


N,CA,NE2,0E1,CD,CG,CB,C,0 




SAVI8 : 


LEU 


Elll: 


N,CA, 002,001,00,06,0,0 




SAVI8: 


GLU 


E112: 


N,CA,0E2,0E1,CD,CG,CB,C,0 


20 


SAVI8: 


GLY 


E115: 


N,CA,C,0 




SAVI8: 


ASN 


E116: 


N,CA,ND2,ODl,CG,0B,C,O 




SAVI8 : 


ALA 


E122: 


N,CA,CB,C,0 




SAVI8: 


SER 


E128: 


N,CA,OG,CB,C,0 




SAVI8 : 


PRO 


E129: 


N,CD,CA,CG,CB,C,0 


25 


SAVI8 : 


SER 


E130: 


N,CA,0G,CB,C,0 




SAVI8 : 


PRO 


E131: 


N , CD , CA , CG , CB , C , O 




SAVI8: 


SER 


E132: 


N,CA,OG,CB,C,0 




SAVI8 : 


ALA 


E133 : 


N,CA,CB,C,0 




SAVI8 : 


THR 


E134: 


N,CA, 002,001,06,0,0 


30 


SAVI8: 


LEU 


E135: 


N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


GLU 


E136: 


N,CA,OE2,OEl,CD,CG,CB,C,0 




SAVI8 : 


GLN 


E137 : 


N,CA,NE2,OEl,CD,CG,0B,C,O 




S AVI 8 : 


ALA 


E138: 


N,CA,CB,C,0 




SAVI8: 


VAL 


E139: 


:N,CA,CG2,CG1,CB,C,0 


35 


S AVI 8 : 


ASN 


E140: 


N,CA,ND2,ODl,CG,0B,C,O 




SAVI8: 


SER 


E141: 


N,CA,OG,CB,C,0 




SAVI8: 


ALA 


E142: 


.N,CA,CB,C,0 




SAVI8: 


THR 


E143' 


:N,CA,CG2,0G1,CB,C,0 




SAVI8: 


.SER 


E144' 


:N,CA,OG,CB,C,0 


40 


SAVI8 s 


:VAL 


E149 


:N,CA, 002,001,08,0,0. 




SAVI8: 


VAL 


E150. 


:N,CA,CG2,CG1,CB,C,0 




SAVI8 : 


. SER 


E156' 


:N,CA,OG,CB,C,0 




SAVI8 ' 


:GLY 


E157' 


:N,CA,C,0 




SAVI8' 


: ALA 


E160 


:N,CA,CB,C,0 


45 


SAVI8 


:GLY 


E161 


:N,0A,C,O 




SAVI8 . 


:SER 


E162. 


:N,0A,OG,CB,C,O 




SAVI8 . 


:ILE 


E165 


:N,CA,CD1,CG1,CB,CG2,C,0 




SAVI8- 


:SER 


E166 


:N,CA,0G,CB,C,0 




SAVI8 


:TYR 


E167 


:N,CA,OH,CZ,CD2 ,CE2 , CE1 , CD1 , CG, CB , C, O 


50 


SAVI8 


:PRO 


E168 


:N,CD,CA,CG,CB,C,0 




SAVI8 


: ARG 


E170 


:N,CA,NH2,NHl,CZ,NE,0D,CG,CB,C,O 




SAVI8 


: TYR 


E17 1 : N , CA , OH, CZ , CD2 , CE2 , CE1 , CD1 , CG , CB , C, O 




SAVI8 


:ASN 


E173 :N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 


:THR 


E180:N,CA,CG2,OGl,CB,C,O 


55 


SAVI8 


:ASP 


E181:N,CA,OD2,ODl,CG,CB,C,0 




SAVI8 


:GLN 


E182 :N,CA,NE2 , OEl , CD, CG, CB, C , O 




SAVI8 


:ASN 


E183 :N,CA,ND2,0D1,CG,CB,C,0 
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35 



40 



45 



50 



55 



.,ND2,0D1,CG,CB,C,0 
.,ND2 , OD1 , CG , CB , C , O 
.,NH2,NH1,CZ,NE, 00,06,08,0,0 
1,08,0,0 
l,OG,CB,C,0 
l,OG, 0B,C,O 

.,NE2,0E1,CD,CG,CB,C,0 
.,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 
.,CB,0,O 

l,CG2,CG1,CB,C,0 
.,ND2,0D1,CG,CB,C,0 
.,NE2 , OE1 , CD, CG, CB , C, O 
/C,0 

.,OG,0B,C,O 
l,CG2,0G1,CB,C,0 
i,CB, 0,0 
.,OG,CB,C,0 
.,CG2,CG1,CB,C,0 
.,CB,C,0 

.,c,o 

.,08,0,0 

l,CG2,0G1,CB,C,0 
k ,OG,CB,0,O 
.,CD2,CD1,CG,CB,C,0 
.,0,0 

.,OG,CB,C,0 
.,ND2,ODl,CG,CB,C,0 
,,002,001,00,06,0,0 
,,CD2,CD1,CG,CB,C,0 
,,CG2,CG1,CB,C,0 
,,ND2,ODl,CG,CB,C,0 
Subset SUB5B: 

sub5bmole. list 
Subset SUB5B: 

SAVI8 : E2-E4 , E16 , E19-E21, E23-E24 , E28 , E37 , E4 1 , E44-E45 , 
E77-E81,E87-E88, 

SAVI8 : E90 , E113-E114 , E117-E118 , E120-E121 , E145- 

E148 , E169, E172 , E174-E176 , 

SAVI8:E193-E196,E198-E199,E214,E231- 

E234,E236,E243,E247,E250,E253-E254 t 

SAVI8 : E260 , E263-E266 , E270-E273 ,M276H-M277H 

sub5batom. list 
Subset SUB5B: 

SAVI8:GLN E2:N,CA,NE2,0El,CD,CG,CB,C,O 
SAVI8:SER E3:N,CA,0G,CB,C,0 
SAVI8:VAL E4:N,CA,CG2,CG1,CB,C,0 
SAVI8:ALA E16:N,CA,CB,C,0 

SAVI8XARG E19:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
SAVI8:GLY E20:N,CA,C,0 

SAVI8:LEU E21:N, OA, CD2 ,CD1 , CG, CB, 0,0 

SAVI8:GLY E23:N,CA,C,0 

SAVI8:SER E24 :N, CA,OG, CB, C, O 

SAVI8:VAL E28 : N, OA, CG2 , CGI , CB , C, O 

SAVI8:SER E37:N, CA,OG,CB, C,0 

SAVI8 : ASP E4 1 : N , OA , OD2 , OD1 , CG , CB , C , O 

SAVI8:ILE E44:N,CA,CD1,CG1,CB,CG2,C,0 

SAVI 8 : ARG E45 : N, OA, NH2 , NH1 , CZ , NE , CD , CG , CB , C , O 





SAVI 8: 


ASN 


E184: 


N, 


OA 




SAVI 8: 


ASN 


E185: 


N, 


OA 




SAVI 8: 


ARG 


E186: 


N, 


OA 




SAVI 8: 


ALA 


E187: 


N, 


OA 


5 


SAVI 8: 


SER 


E188: 


N, 


OA 




SAVI 8 : 


SER 


E190: 


N, 


OA 




SAVI 8 : 


GLN 


E191: 


N, 


CA 




SAVI 8 : 


TYR 


E192: 


N, 


OA 




SAVI 8 : 


ALA 


E200; 


N, 


CA 


10 


SAVI 8 : 


VAL 


E203 


N, 


CA 




SAVI 8 : 


ASN 


E204 : 


N, 


CA 




SAVI 8 : 


GLN 


E206: 


N, 


CA 




SAVI 8: 


GLY 


E211 


N, 


CA 




SAVI 8 ; 


SER 


E212 


N, 


CA 


15 


SAVI 8 : 


THR 


E213 


:N, 


CA 




SAVI 8 . 


ALA 


E215. 


■N ( 


CA 




SAVI 8: 


SER 


E216 


N, 


CA 




SAVI 8' 


.VAL 


E227 


:N, 


CA 




SAVI 8 


: ALA 


E228 


;N, 


CA 


20 


SAVI 8 


.GLY 


E229 


:N 


,CA 




SAVI 8 


: ALA 


E230 


:N 


,CA 




SAVI 8 


rTHR 


E255 


:N ( 


,CA 




SAVI 8 


I SER 


E256 


:N, 


,CA 




SAVI 8 


:LEU 


E257 


:N ( 


r CA 


25 


SAVI 8 


:GLY 


E258 


:N 


r CA 




SAVI 8 


:SER 


E259 


:N 


r CA 




SAVI 8 


:ASN 


E261 


:N 


r CA 




SAVI 8 


:LEU 


E262 


:N ( 


r CA 




SAVI 8 


:LEU 


E267 


:N 


r CA 


30 


SAVI 8 


:VAL 


E268 


:N 


,CA 




SAVI 8 


:ASN 


E269 


:N 


r CA 
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SAVI8 
SAVI8 
SAVI8 
SAVI8 
5 SAVI8 
SAVI8 
SAVI8 
SAVI8 



ASN E77:N,CA,ND2,0D1,CG,CB,C,0 
SER E78:N,CA,0G,CB,C,0 
ILE E79 :N,CA, CD1 , CGI, CB, CG2 , C,0 
GLY E80:N,CA,C,0 
VAL E81:N,CA, 0G2,CGl,CB,C,O 
SER E87:N,CA, OG,CB,C,0 
ALA E88:N,CA,CB,C,0 
LEU E90:N,CA,CD2,CD1, CG,CB,C,0 





*>AVT8 ■ 

un« X O « 


TRP 


E113 : 


:N, 


CA, 


CD2,CE2,NE1,CD1,CG,CE3 , CZ3 , 


CH2,CZ2,0B,C,O 


t n 

JL\J 




AT. A 

AX1A 


PI 1 4 ' 




PA 


CB C O 






c 2V\7T ft « 


noli 


PI 1 7 ' 




PA 

V»A ( 


ND2 OD1 CG CB . C - O 






DnV J.O < 


f!T.V 




M 


PA 

v» a. , 


c o 






onVio « 


nX o 


PI ?fl ' 




CA 


CD2 NE2 CE1 . ND1 - CG . CB .CO 






OA VIO < 


VAT. 

V 4v\XJ 


XlX & X * 


• N 


CA. 


CG2 . CGI - CB . C . 0 




i 

-L «^ 


OA V J. O i 


ARG 


E145 : 


!N, 


CA, 


NH2 . NH1 . CZ . NE . CD . CG , CB , C , 0 






SAVI8 : 


GLY 


E146 : 


:N, 


CA, 


C,0 






RAVT8 ' 


VAL 


E147 


:N, 


CA, 


CG2 . CGI - CB .CO 






SAVT8 * 


LEU 


E148 : 


:N, 


CA, 


CD2 . CD1 . CG . CB . C . 0 






CAVT8 * 

OA V x o * 


AT. A 




■ N 


CA, 


CB CO 






Onvlo 


AT. A 


Hi X / 


1 W 


PA 


PR P O 






c;a\7Tft < 


AT. A 


PI 7 4 

Xt X / *T 


» IN , 


PA 


PB C O 






c;a\7Tft ■ 

OA V X o < 


MPT 


FT 7^ • 

Xj X / U 


M 


PA 


CP SD CG CB c . o 






OA V X O 


AT. A 




k IN 


CA 


CB C O 






OA VIO i 


GT.Y 




k IN { 


CA 


C , 0 






OA V X O 4 


ATiA 




N 

k IN ^ 


PA 


CB C O 






U/l V -1- o 


GLY 


E195 


:N 


CA 


c,o 






SAVI8 : 


LEU 


E196 ' 


:N, 


CA, 


CD2 , CD1 , CG , CB , C , O 






SAVI8 : 


ILE 


E198 : 


:N, 


CA, 


CD 1 , CG 1 , CB , CG2 , C , O 






SAVI8 


VAL 


E199 . 


:N, 


CA, 


CG2 / CGl,CB,C / 0 




30 


SAVI8 : 


TYR 


E214 ' 


:N, 


CA, 


OH, CZ , CD2 , CE2 , CE1 , CD1 , CG , CB 


,c,o 




SAVI8 : 


ALA 


E231: 


:N, 


CA, 


CB.C.O 






SAVI8 


.ALA 


E232 ' 


:N, 


CA, 


CB,C,0 






SAVI8 


LEU 


E233 


:N, 


CA, 


CD2,CD1,CG,CB,C,0 






SAVI8 ' 


VAL 


E234' 


:N, 


CA, 


CG2,CG1,CB,C,0 




35 


SAVI8 


: GLN 


E236 


:N, 


CA, 


NE2,0El,CD,CG,CB,C / 0 






SAVI8 ' 


.ASN 


E243 


:N, 


CA, 


ND2 / ODl,CG,CB,C,0 






SAVI8 


'ARG 


E247 


:N, 


CA, 


NH2 # NH1,CZ,NE,CD,CG,CB,C,0 






SAVI8 


:LEU 


E250 


:N, 


CA, 


002,001,00,06,0,0 






SAVI8 


:THR 


E253 


:N, 


CA, 


CG2 # 0G1,CB,C,0 




40 


SAVI8 


1 ALA 


E254 


:N, 


CA, 


CB,C,0 






SAVI8 


:THR 


E260 


:N, 


CA, 


CG2,0G1,CB,C,0 






SAVI8 


:TYR 


E263 


:N, 


CA, 


OH , CZ , CD2 , CE2 , CE1 , CD1 , CG , CB 


,c,o 




SAVI8 


:GLY 


E264 


:N, 


CA, 


C,0 






SAVI8 


:SER 


E265 


:N, 


CA, 


OG,CB,C,0 




45 


SAVI8 


: GLY 


E266 


:N 


CA, 


0,0 






SAVI8 


: ALA 


E270 


:N, 


CA, 


CB,C,0 






SAVI8 


:GLU 


E271 


:N, 


,CA, 


0E2 / OEl,CD,0G,CB,C,O 






SAVI8 


: ALA 


E272 


:N 


CA 


CB,C,0 






SAVI8 


: ALA 


E273 


:N 


CA 


CB,C,0 





50 SAVI8:ION M276H:CA 
SAVI8:I0N M277H:CA 
Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

55 SAVI8 : E29-E35 , E48-E51 , E54 , E58-E72 , E91-E102 , E106-E107 , E110 , E123- 
E127, 
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SAVI8 : E151-E155, E177-E179 , E189 , E201-E202 , E205, E207-E210 , E217- 
E226 

actsiteatom. list 
5 Subset ACTSITE: 





SAVI8 : 


ALA 


E29: 


N,CA,CB,C,0 






SAVI8 : 


VAL 


E30: 


N, CA, CG2 , CGI , CB, C , 0 






SAVI8 : 


LEU 


E31: 


N,CA, 002,001, 00,06,0,0 






SAVI8 : 


ASP 


E32; 


N,CA,OD2,ODl,CG,C5,C, O 




10 


SAVI8: 


THR 


E33: 


N,CA,CG2,OGl,CB,C,0 






SAVI8: 


GLY 


E34: 


N,CA,C,0 






SAVI8 : 


ILE 


E35: 


N,CA,CD1,CG1,CB,CG2,C,0 






SAVI8 : 


ALA 


E48: 


N,CA>CB,C,0 






SAVI8 : 


SER 


E49: 


N,CA,0G,CB,C,0 




15 


SAVI8 : 


PHE 


E50: 


N,CA,CD2,CE2,CZ,CE1,CD1,CG, 


CB,C,0 




SAVI8 : 


VAL 


E51: 


N, CA, CG2 , CGI , C6, C, 0 






SAVI8 : 


GLU 


E54: 


N,CA,OE2,OEl,CD,CG,CB,C,0 






SAVI8 : 


THR 


E58: 


N,CA,CG2,0G1,CB,C,0 






SAVI8: 


GLN 


E59: 


N,CA,NE2,OEl,CD,CG,CB,C,0 




20 


SAVI8 : 


ASP 


E60: 


N,CA,0D2,0D1,CG,CB,C,0 






SAVI8 : 


GLY 


E61: 


N,CA,C,0 






SAVI8 : 


ASN 


E62: 


N,CA,ND2,0D1,CG,CB,C,0 






SAVI8 : 


GLY 


E63 : 


N,CA,C,0 






SAVI8 : 


HIS 


E64: 


N,CA,CD2,NE2,CE1,ND1,CG,CB, 


C,0 


25 


SAVI8. 


GLY 


E65: 


N,CA,C,0 






SAVI8 : 


THR 


E66: 


N,CA,CG2,0G1,CB,C,0 






SAVI8 « 


HIS 


E67 


N,CA,CD2,NE2,CEl f ND1,CG,CB, 


c,o 




SAVI8 


.VAL 


E68 


N,CA, 002,001,06,0,0 






SAVI8 « 


ALA 


E69" 


N,CA,CB,0,O 




30 


SAVI8 


GLY 


E70 


N,CA,C,0 






SAVI8 


•THR 


E71 


.N,CA,CG2,OG1,C8,C,0 






SAVI8 


. ILE 


E72 


N,CA,CD1,CG1,C5,CG2,C,0 






SAVI8 


: TYR 


E91 


.N,CA,0H,CZ,CD2,CE2,CE1,CD1, 


CG , CB , C , O 




SAVI8 


: ALA 


E92 


:N,CA,CB,C,0 




35 


SAVI8 


:VAL 


E93 


N,CA,CG2,CG1,CB,C,0 






SAVI8 


:LYS 


E94 


:N,CA,NZ,CE,CD,CG,CB,C,0 






SAVI8 


:VAL 


E95 


:N,CA,CG2,CG1,CE,C,0 






SAVI8 


:LEU 


E96 


.N,CA,CD2,CD1,CG,CB,C,0 






SAVI8 


: GLY 


E97 


:N,CA,C,0 




40 


SAVI8 


: ALA 


E98 


:N,CA,C6,C,0 






SAVI8 


:SER 


E99 


:N,CA,0G,C6,C,0 






SAVI8 


:GLY 


E100:N,CA,C,O 






SAVI8 


:SER 


E101:N,CA, OG,CB,C,0 






SAVI8 


:GLY 


E102:N,CA,C,0 




45 


SAVI8 


:SER 


E106:N,CA, OG,CB,C,0 






SAVI8 


:ILE 


E107:N,CA,CD1, CGI ,CB,CG2, 0,0 






SAVI8 


: GLY 


E110:N,CA,C,O 






SAVI8 


:ASN 


E123 : N , CA , ND2 , OD1 , CG , CB , C , 0 






SAVI8 


:LEU 


E124:N,CA,CD2,CD1,CG,CB,C,0 




50 


SAVI8 


:SER 


E125:N,CA,OG,C8,C,0 






SAVI8 


:LEU 


E126:N,CA, 002,001, 06,06,0,0 






SAVI8 


: GLY 


E127:N,CA,C,0 






SAVI8 


: ALA 


E151:N, CA,CB,C,0 






SAVI8 


: ALA 


E152:N,CA,CB,C,0 




55 


SAVI8 


:SER 


E153:N,CA,0G,CB,C,0 






SAVI8 


: GLY 


E154:N,CA,C,0 






SAVI8 


:ASN 


E155:N,CA,ND2,0D1,CG,CB,C,0 





- WO 98/35026 



66 



PCT7DK98/00046 



SAVI8 


:VAL 


E177 


N, CA, CG2 , CGI, C8, C, 0 




SAVI8 : 


:GLY 


E178. 


N,CA,C,0 




SAVI8 ' 


: ALA 


E179' 


N,CA,CB,0,O 




SAVI8 


:PHE 


E189 


:N,CA,CD2,CE2,CZ, CE1,CD1,CG, 


CB , C , O 


SAVI8 ' 


:PRO 


E201' 


N, CD, CA, CG, CB,C,0 




SAVI8 


:GLY 


E202 


N,CA,C,0 




SAVI8 


:VAL 


E205 


:N,CA, 002,061, 06,0,0 




SAVI8 ' 


:SER 


E207: 


N,CA,OG,CB,C,0 




SAVI8 


:THR 


E208- 


N,CA, 002,001,06,0,0 




SAVI8 


:T¥R 


E209 


:N,CA,0H,CZ,CD2,CE2,CE1,CD1, 


CG,CB, C,0 


SAVI8 


:PRO 


E210. 


N , CD , CA , CG , CB , C , O 




SAVI8 


:LEU 


E217 


N , CA , CD2 , CD1 , CG , CB , C , O 




SAVI8 


:ASN 


E218 


N,CA / ND2,0D1,CG,CB,C,0 




SAVI8 


:GLY 


E219: 


N,CA,C,0 




SAVI8 


:THR 


E220 


N,CA,CG2,0G1,CB,C,0 




SAVI8 


:SER 


E221 


N , CA , OG , CB , C , O 




SAVI8 


:MET 


E222 


N,CA,CE, 80,00,06,0,0 




SAVI8 


: ALA 


E223 


N,CA,CB,C,0 




SAVI8 


: THR 


E224 


N,CA, 002,001,08,0,0 




SAVI8 


:PRO 


E225 


N,CD,CA, 00,06,0,0 




SAVI8 


:HIS 


E226 


N,CA,CD2,NE2,CE1 / ND1,CG,CB, 


C,0 


Subset RESTx: 








restxmole. list 






Subset RESTX: 









25 NEWMODEL: E5 , E13-E14 , E22 , E38-E40 , 

E42 ,E73-E76,E82-E86,E103-E105, 
NEWMODEL: E108 , E122 , E133-E135 , E137-E140 , 

E149-E150,E173,E204,E2 06, 
NEWMODEL: E211-E213 , E215-E216 , E227- E229, 
30 E258,E269 





restxatora. 


list 




Subset RESTX: 








NEWMODEL: 


PRO 


E5:N,CD,CA,CG, CB,C,0 




NEWMODEL: 


ALA 


E13:N,CA,CB,C,0 


35 


NEWMODEL: 


PRO 


E14:N,CD,CA,CG,CB,C,0 




NEWMODEL: 


THR 


E22:N,CA,CG2,0G1,CB,C,0 




NEWMODEL: 


THR 


E38:N,CA,CG2,0G1,CB,C,0 




NEWMODEL: 


HIS 


E39:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 




NEWMODEL: 


PRO 


E40:N,CD / CA,CG,C8,C,0 


40 


NEWMODEL: 


LEU 


E42:N,CA,CD2,CD1,CG,CB,C,0 




NEWMODEL: 


ALA 


E73:N,CA,CB,C,0 




NEWMODEL: 


ALA 


E74:N,CA,CB,C,0 




NEWMODEL: 


LEU 


E75:N,CA,CD2,CD1,CG,CB,C,0 




NEWMODEL. 


ASN 


E76:N,CA,ND2,0D1,CG,CB,C,0 


45 


NEWMODEL. 


LEU 


E82:N,CA,CD2,CDl,CG,0B,C,O 




NEWMODEL 


•GLY 


E83:N,CA,C,0 




NEWMODEL 


:VAL 


E84 : N , CA , CG2 , CGI , CB , C , 0 




NEWMODEL 


: ALA 


E85:N,CA,CB,C,0 




NEWMODEL 


:PRO 


E86 : N, CD, CA, 00,06,0,0 


50 


NEWMODEL 


:SER 


E103:N,CA,OG,CB,C,0 




NEWMODEL 


:VAL 


E104:N,CA, 002,001,08,0,0 




NEWMODEL 


:SER 


E105:N, CA,OG,CB,C,0 




NEWMODEL 


: ALA 


E108:N,CA,CB,C,O 




NEWMODEL 


: ALA 


E122:N,CA,CB,C,0 


55 


NEWMODEL 


: ALA 


E133:N,CA,CB,C,0 




NEWMODEL 


: THR 


E134:N / CA,CG2,0Gl,CB,C,O 




NEWMODEL 


:LEU 


E135:N,CA,CD2,CDl,0G,C8,C,O 
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10 



15 



20 



NEWMODEL : GLN E137 
NEWMODEL : ALA E138 
NEWMODEL : VAL E139 
NEWMODEL :ASN E140 
NEWMODEL: VAL E149 
NEWMODEL: VAL El 50 
NEWMODEL :ASN E173 
NEWMODEL :ASN E204 
NEWMODEL: GLN E206 
NEWMODEL : GLY E211 
NEWMODEL : SER E212 
NEWMODEL : THR E213 
NEWMODEL: ALA E215 
NEWMODEL: SER E216 
NEWMODEL: VAL E227 
NEWMODEL: ALA E228 
NEWMODEL: GLY E229 
NEWMODEL: GLY E258 
NEWMODEL :ASN E269 



67 

N,CA,NE2,0E1,CD,CG,CB,C,0 

N,CA,CB,C,0 

N, CA,CG2,CG1,CB,C,0 

N / CA / ND2 / 001,06,08,0,0 

N, CA, CG2 , CGI , CB , C, O 

N, CA, CG2 , CGI, CB , C, O 

N,CA,ND2,0D1,CG,CB,C,0 

N,CA,ND2,0D1,CG,CB,C,0 

N,CA,NE2,0E1,CD,CG,CB,C,0 

N,CA,C,0 

N,CA,0G,CB,C,0 

N,CA,CG2,OGl,CB,C,0 

N,CA,CB,C,0 

N,CA,OG,CB,C,0 

N,CA,CG2,CG1,CB,C,0 

N,CA,CB,C,0 

N,CA,C,0 

N,CA,C,0 

N, CA, ND2 , ODl, CG,CB, C, O 



Example 3 

Suitable substitutions in PD498 for addition of carboxvlic acid 
attachment groups f-COOH^ 

The 3D structure of PD498 was modeled as described in 
25 Example 1. 

Suitable locations for addition of carboxylic attachment groups 
(Aspartatic acids and Glutamic acids) were found as follows. 
The procedure described in Example 1 was followed- The 
commands performed in Insight (BIOSYM) are shown in the command 
30 files makeDEzone.bcl and makeDEzone2 .bcl below: 



Conservative substutitions: 

makeDEzone . bcl 

Delete Subset * 
35 Color Molecule Atoms * Specified Specification 255,0,255 

Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

40 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERM :280:O Static monomer/residue 10 color_Subset 
255,255,0 

#N0TE: editnextline ACTSITE residues according to the protein 
45 Zone Subset ACTSITE : 39, 72, 226 Static monomer /residue 8 

Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 

Combine Subset ALLZONE Union ALLZONE CTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 
50 #NOTE: editnextline object name according to the protein 

Combine Subset REST Difference PD4 9 8 FINALMODEL ALLZONE 
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List Subset REST Atom Output File restatom. list 
List Subset REST monomer/ resxdue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output_File actsitea torn. list 
5 List Subset ACTSITE monomer/ residue Output_File 
actsitemole . list 
# 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 

10 Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File sub5batom. list 
List Subset SUB5B monomer/ residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 

15 ... 

#continue with makezone2.bcl. 

#Use grep command to identify asn/gln in restatom. list ... 
#sub5batom. list & accsitea torn. list 

20 Comments: 

The subset REST contains Gln33 and Asn245, SUB5B contains 
Glnl2, Glnl26, Asn209, Gln242, Asn246, Gln248 and Asn266, all 
of which are solvent exposed. 

The substitutions Q12E or Q12D, Q33E or Q33D, Q126E or 
25 Q126D, N209D or N209E, Q242E or Q242D, N245D or N245E, N246D or 
N246E, Q248E or Q248D and N266D or N266E are identified in 
PD498 as sites for mutagenesis within the scope of this 
invention. Residues are substituted below in section 2, and 
further analysis done: 



Non-conservative substitutions : 
makeDEzone2 . bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace PD 4 9 8 FINALMODEL newmodel 
40 Biopolymer 

#N0TE: editnextline object name according to protein 
Blank Object On PD498FINALMODEL 

#NOTE: editnextlines with asn->asp & gln->glu positions 

Replace Residue newmodel: 33 glu L 
45 Replace Residue newmodel: 245 asp L 

Replace Residue newmodel: 12 glu L 

Replace Residue newmodel: 126 glu L 

Replace Residue newmodel: 209 asp L 

Replace Residue newmodel: 242 glu L 
50 Replace Residue newmodel: 246 asp L 

Replace Residue newmodel: 248 glu L 
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Replace Residue newmodel: 266 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu / . . . 
#now including introduced asp & glu 
5 Color Molecule Atoms newmodel Specified Specification 255,0,255 
Zone Subset ASPx newmodel : asp : od* Static monomer /residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel: glu :oe* Static monomer /residue 10 
Color_Subset 255,255,0 
10 #N0TE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel: 280:0 Static monomer /residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
15 Zone Subset ACTSITEx newmodel : 39 , 72 , 226 Static monomer/ residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
20 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output_File restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
25 List Subset ACTSITEx Atom Output File acts itexatom. list 
List Subset ACTSITEx monomer/resTdue Output_File 
actsitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
30 (not_gluasp) ->gluasp ... 
#subst. if needed 

Comments : 

The subset RESTx contains only two residues: A233 and G234, 

35 none of which are solvent exposed. No further mutagenesis is 

required to obtain complete protection of the surface. 

However, it may be necessary to remove some of the reactive 

carboxylic groups in the active site region to ensure access to 

the active site of PD498. Acidic residues within the subset 

40 ACTSITE are: D39, D58, D68 and D106. Of these only the two 

latter are solvent exposed and D39 is a functional residue. The 

mutations D68N, D68Q, D106N and D106Q were found suitable 

according to the present invention. 

Relevant data for Example 3: 

45 Solvent accessibility data for PD498MODEL: see Example 1 above. 

Subset REST: 

restmole. list 
Subset REST: 

PD498FINALMODEL: 10-11, 33-35, 54-55, 129-130, 
50 221,233-234,236,240,243, 

PD498FINALMODEL: 245 ,2 62, 264-265 
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restatom. list 



Subset REST: 

PD498 FINALMODEL : ALA 1 0 : N , CA , C , O , CB 
5 PD4 9 8 FINALMODEL :TYR 1 1 : N , CA , C , O , CB , CG , CD1 , CD2 ,CE1, CE2 , CZ , OH 
PD4 98 FINALMODEL : GLN 3 3 : N , CA , C , O , CB , CG , CD , OE1 , NE2 
PD4 9 8 FINALMODEL : THR 3 4 : N , CA , C , O , CB , OG1 , CG2 
PD49 8 FINALMODEL : VAL 3 5 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : ILE 54 : N , CA, C , O , CB , CGI , CG2 , CD1 
10 PD4 9 8 FINALMODEL :LYS 55 :N,CA, C,0,CB,CG,CD, CE,NZ 
PD4 9 8 FINALMODEL :LYS 129 :N, CA, C, O, CB, CG, CD, CE, NZ 
PD4 9 8 FINALMODEL: VAL 130 :N, CA, C, O, CB, CGI , CG2 

PD4 9 8 FINALMODEL : TYR 2 2 1 : N , CA , C , O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL: ALA 233 :N,CA,C,0,CB 
15 PD4 9 8 FINALMODEL :GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL: ALA 236 :N, CA, C, 0, CB 

PD4 9 8 FINALMODEL: ALA 240 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL :GLY 243:N,CA,C,0 

PD4 9 8 FINALMODEL :ASN 245 :N, CA, C, O, CB, CG,ODl ,ND2 
20 PD4 9 8 FINALMODEL :GLY 262:N,CA,C,0 

PD4 9 8 FINALMODEL :GLY 264:N,CA,C,0 

PD4 9 8 FINALMODEL : THR 2 6 5 : N , CA , C , O , CB , OG1 , CG2 
Subset SUB5B: 
sub5bmole. list 
25 Subset SUB5B: 

PD4 9 8 FINALMODEL: 6-9 , 12-13 ,31-32 , 51-53 , 56 , 81, 93-94 , 97- 

99,122,126-128, 

PD4 9 8 FINALMODEL: 131 , 155-157 , 159 , 197-199 , 209 , 211 , 219- 
220,232,235, 

30 PD4 9 8 FINALMODEL: 237-239, 241-242, 244, 246-249, 253,260- 
261,263,266-268 
sub5batom. list 

Subset SUB5B: 



PD4 9 8 FINALMODEL 

35 PD4 9 8 FINALMODEL 
PD4 9 8 FINALMODEL 
PD4 9 8 FINALMODEL 
PD4 9 8 FINALMODEL 
PD498 FINALMODEL 

40 PD498FINALMODEL 
PD498FINALMODEL 
PD4 9 8 FINALMODEL 
PD498 FINALMODEL 
PD4 9 8 FINALMODEL 

45 PD4 9 8 FINALMODEL 
PD498 FINALMODEL 
PD498 FINALMODEL 
PD4 9 8 FINALMODEL 
PD498 FINALMODEL 

50 PD4 9 8 FINALMODEL 
PD498FINALMODEL 
PD4 9 8 FINALMODEL 
PD498 FINALMODEL 
PD4 9 8 FINALMODEL 

55 PD4 9 8 FINALMODEL 
PD4 9 8 FINALMODEL 
PD498FINALMODEL 



PRO 6:N,CA,CD,C,0,CB,CG 

TYR 7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

TYR 8 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

SER 9:N,CA,C,O f CB,OG 

GLN 12:N,CA,C,0,CB,CG,CD,0E1,NE2 

TYR 13:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

SER 31:N,CA,C,0,CB,OG 

THR 32:N,CA,C,0,CB,0G1,CG2 

ARG 51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

LYS 52:N,CA,C,0,CB, CG,CD,CE,NZ 

VAL 53:N,CA,C,0,CB,CG1,CG2 

GLY 56:N,CA,C,0 

ALA 81:N,CA,C,0,CB 

MET 93:N,CA,C,0,CB,CG,SD,CE 

ALA 94:N,CA,C,0,CB 

THR 97:N,CA,C,0,CB,OG1,CG2 

LYS 98:N,CA,C,0,CB,CG,CD, CE,NZ 

ILE 99:N,CA, 0,0,06,001,002,001 

TYR 122:N,CA,C,0,CB,CG,CD1,CD2,CE1 / CE2,CZ,0H 

GLN 126:N,CA,C,0,CB,CG,CD,OEl,NE2 

GLY 127:N,CA,C,0 

ALA 128:N,CA,C,0,CB 

LEU 131:N,CA,C,0,CB,CG,CD1,CD2 

GLY 155:N,CA,C,0 
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:,0,CB 

:,0,CB,CG1,CG2 
:,0,CB,CG1,CG2 
O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

:,o 

:,0,CB,0G1,CG2 
:,O,0B,CG,ODl,ND2 

:,o,cb 

:,0,c8,cg,cd1,cd2,ce1,ce2,cz,0h 

: # 0,CB,0G 
0,06,001,002 

:,0,cb,cg,cd1,cd2 
:,o,cb 

1,0,06,00,001,002 

:,o,cb,cg,cdi,cd2 
:,o,C8,og 

:, 0 , cb , cg , cd , 0e1 , ne2 
:,o,cb,cg,cd, ce,nz 
:,o,cb,cg,odi,nd2 
20 pd498finalm0del : val 247 : n, oa, 0,0, cb ,cg1 , cg2 

:,0,CB,CG,CD,OEl,NE2 
:,0,CB,CG1,CG2,CD1 
: , O , CB , CGI , CG2 , CD 1 
1,0,06,001,002,001 
25 PD498FINALM0DEL: SER 261 : N, CA, C, O, CB, OG 

:,0,CB,OGl,CG2 
:,0,CB,CG,ODl,ND2 
:,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
:,0,CB,CG,CD,CE,NZ 
3 0 Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

PD498FINALMODEL:36-42, 57-60, 66-80,100-110, 
115-116,119,132-136,160-164, 
35 PD4 9 8 FINALMODEL : 182-184 , 194 , 206-207 , 210 , 

212-215,222-231 
actsiteatom. list 
Subset ACTSITE: 

PD4 9 8 F INALMODEL : ALA 36:N,CA,C,0,CB 
40 PD4 9 8 FINALMODEL: VAL 37 :N, CA, C, O, CB, CGI , CG2 

PD4 9 8 FINALMODEL: LEU 38 :N, CA, C,0 , CB, CG, CD1 , CD2 
PD4 9 8 FINALMODEL: ASP 39 : N, CA, C, O ,CB, CG, OD1 ,0D2 
PD4 9 8 FINALMODEL: SER 40 :N, CA, C, O, CB, OG 
PD4 9 8 FINALMODEL :GLY 41:N,CA,C,0 
45 PD4 9 8 FINALMODEL: VAL 42 : N, CA, 0,0, CB, CGI , CG2 

PD498 FINALMODEL : TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL: ASP 58 : N, CA, 0,0, CB, CG,ODl ,OD2 
PD498 FINALMODEL : PHE 
50 59:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL : ILE 6 0 : N , CA , C , O , CB , CG 1 , CG2 , CD1 
PD4 9 8 FINALMODEL: PRO 66 : N, CA, CD, C , 0,OB , CG 
PD4 9 8 FINALMODEL: MET 67 : N, CA, C,0, CB, CG, SD, CE 
PD4 9 8 FINALMODEL : ASP 68 : N , CA , C , O , CB , CG , ODl , OD2 
55 PD4 9 8 FINALMODEL: LEU 69 : N, CA, C, O, CB, CG, CD1 , CD2 

PD4 9 8 FINALMODEL : ASN 7 0 : N , CA , C , O , CB , CG , ODl , ND2 
PD4 9 8 FINALMODEL :GLY 71:N,CA,C,0 
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PD4 9 8 FINALMODEL : HIS 
PD4 9 8FINALMODEL : GLY 
PD4 9 8FINALMODEL : THR 
PD49 8FINALMODEL : HIS 
5 PD4 9 8 FINALMODEL : VAL 

PD49 8 FINALMODEL : ALA 
PD498 FINALMODEL : GLY 
PD4 9 8FINALMODEL : THR 
PD49 8 FINALMODEL : VAL 

10 PD4 9 8 FINALMODEL: LEU 

PD4 9 8 FINALMODEL: ALA 
PD4 9 8 FINALMODEL : VAL 
PD498 FINALMODEL : ARG 
CG,CD,NE,CZ,NH1 

15 PD4 9 8 FINALMODEL: VAL 

PD498 FINALMODEL : LEU 
PD4 9 8 FINALMODEL : ASP 
PD498 FINALMODEL : ALA 
PD498 FINALMODEL : ASN 

20 PD4 9 8 FINALMODEL: GLY 

PD4 9 8 FINALMODEL: SER 
PD4 9 8 FINALMODEL: SER 
PD4 9 8 FINALMODEL: ILE 
CG1,CG2,CD1 

25 PD498 FINALMODEL : GLY 

PD498 FINALMODEL : ASN 
PD4 9 8 FINALMODEL: LEU 
PD4 9 8 FINALMODEL: SER 
PD4 9 8 F I NALMODEL : LEU 

30 PD4 9 8 FINALMODEL: GLY 

PD4 9 8 FINALMODEL : ALA 
PD498 FINALMODEL : ALA 
. PD 4 9 8 F INALMODEL : ALA 
PD4 9 8 FINALMODEL : GLY 

35 PD4 9 8 FINALMODEL: ASN 

PD49 8FINALMODEL : VAL 
PD498 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : PHE 

40 CG,CD1,CD2,CE1, 
PD4 9 8 FINALMODEL: PRO 
PD498 FINALMODEL : GLY 
PD4 9 8 FINALMODEL: ILE 
CGI , CG2 , CD1 

45 PD498FINALMODEL: SER 

PD49 8 FINALMODEL : THR 
PD498 FINALMODEL : VAL 
PD 4 9 8 FINALMODEL : PRO 
PD4 9 8 FINALMODEL : MET 

50 PD4 9 8 FINALMODEL: SER 

PD498 FINALMODEL : GLY 
PD498 FINALMODEL : THR 
PD4 9 8 FINALMODEL: SER 
PD4 9 8 FINALMODEL : MET 

55 PD4 9 8 FINALMODEL: ALA 

PD4 9 8 FINALMODEL: SER 
PD498 FINALMODEL : PRO 
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PD4 9 8 FINALMODEL : HI S 23 1 : N, CA, C, O, CB, 
CG , ND1 , CD2 , CE1 , NE2 
Subset RESTx: 

restxmole. list 
5 Subset RESTX: 

NEWMODEL : 2 3 3 -2 3 4 
restxatom. list 
Subset RESTX: 

NEWMODEL: ALA 233 :N,CA, C,0,CB 
10 NEWMODEL : GLY 234:N / CA / C / 0 

Example 4 

Suitable substitutions in the Arthromyces ramosus peroxidase 
for addition of carboxylic acid attachment groups f-COOH) 
15 Suitable locations for addition of carboxylic attachment 
groups (Aspartatic acids and Glutamic acids) in a non- 
hydrolytic enzyme, Arthromyces ramosus peroxidase were found as 
follows. 

The 3D structure of this oxido-reductase is available in the 
20 Brookhaven Databank as larp.pdb. This A. ramosus peroxidase 
contains 344 amino acid residues. The first eight residues are 
not visible in the X-ray structure: QGPGGGGG , and N143 is 
glycosylated . 

The procedure described in Example 1 was followed. 
25 The amino acid sequence of Arthromyces ramosus Peroxidase 

(E.C.I. 11. 1.7) is shown in SEQ ID NO 4. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeDEzone. bcl and makeDEzone2 . bcl below. The C- 

terminal residue is P344, the ACTSITE is defined as the heme 
30 group and the two histidines coordinating it (H56 fit H184) . 

Conservative substitutions: 

makeDEzone. bcl 

Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
35 Zone Subset ASP :asp:od* Static monomer/residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline C-terminal residue number according to the 
40 protein 

Zone Subset CTERM : 344:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#N0TE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :HEM,56,184 Static monomer /residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 
Combine Subset ALLZONE Union ALLZONE CTERM 
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Combine Subset ALLZONE Union ALLZONE ACTSITE 
#NOTE: editnextline object name according to the protein 
Combine Subset REST Difference ARP ALLZONE 
List Subset REST Atom Output_File restatom. list 
5 List Subset REST monomer /residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output File acts iteatom. list 
List Subset ACTSITE monomer /residue Output_File 
actsitemole. list 
10 # 

Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
15 List Subset SUB5B Atom Output File sub5batom. list 

List Subset SUB5B monomer/ residue Output_File sub5bmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 

* • • 

#continue with raakezone2 .bcl. 
20 #Use grep command to identify asn/gln in restatom. list ... 
#sub5ba torn. list & aces iteatom. list 

Comments : 

The subset REST contains Gln70, and SUB5B contains Gln34, 
25 Asnl28, Asn303 all of which are solvent exposed. The 

substitutions Q34E or Q34D, Q70E or Q70D, N128D or N128E and 
N303D or N303E are identified in A. ramosus peroxidase as sites 
for mutagenesis. Residues are substituted below and further 
analysis done: 

3 0 

Non-conservative substitutions: 
makeDEzone2 . bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for ... 

#asn->asp & gln->glu substitutions 

/NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace ARP newraodel 
40 Biopolymer 

#N0TE: editnextline object name according to protein 
Blank Object On ARP 

#NOTE: editnextlines with asn->asp & gln->glu positions 
Replace Residue newmodel:34 glu L 
45 Replace Residue newmodel:70 glu L 
Replace Residue newmodel:128 asp L 
Replace Residue newraodel: 303 asp -L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, . . . 
50 #now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
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Zone Subset ASPx newmodel: asp rod* Static monomer /residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel:glu:oe* Static monomer/residue 10 
Color_Subset 255,255,0 
5 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel: 344:0 Static monomer /residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel: HEM, 56, 184 Static monomer/residue 

8 ColorjSubset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer /residue Output File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
20 List Subset ACTSITEx Atom Output_File actsitexatom. list 
List Subset ACTSITEx monomer/ residue Output_File 
actsitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
25 (not_gluasp) ->gluasp . . . 
#subst. if needed 

Comments : 

The subset RESTx contains only four residues: S9, S334, G335 

30 and P336, all of which are >5% solvent exposed. The mutations 
S9D, S9E, S334D, S334E, G335D, G335E, P336D and P336E are 
proposed in A. ramosus peroxidase. Acidic residues within the 
subset ACTSITE are: E44, D57, D77, E87, E176, D179, E190, D202, 
D209, D246 and the N-terminal carboxylic acid on P344. Of these 

35 only E44, D77, E176, D179, E190, D209, D246 and the N-terminal 
carboxylic acid on P344 are solvent exposed. Suitable sites for 
mutations are E44Q, D77N, E176Q, D179N, E190Q, D209N and D246N. 
D246N and D246E are risky mutations due to D246's importance 
for binding of heme. 

40 The N-terminal 8 residues were not included in the 

calculations above, as they do not appear in the structure. 
None of these 8 residues, QGPGGGG, contain carboxylic groups. 
The following variants are proposed as possible mutations to 
enable attachment to this region: Q1E, Q1D, G2E, G2D, P3E, P3D, 

45 G4E, G4D, G5E, G5D, G6E, G6D, G7E, G7D, G8E, G8D. 
Relevant data for Example 4: 
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Solvent accessibility data for A. ramosus peroxidase (Note: 
as the first eight residues are missing in the X-ray structure, 
the residue numbers printed in the accessibility list below are 
8 lower than those used elsewhere for residue numbering. 
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82 . 698059 




ASN_28 


96 . 600883 


35 


PHE__29 


32 . 375275 




TYR_3 0 


5 . 898365 




GLN_3 1 


103 . 380585 




GLY_32 


40.042034 




SER_33 


46.789322 


40 


LYS_34 


87 . 161873 




CYS 35 


12 . 827215 




GLU 36 


51.582657 




SER 37 


16.378180 




PRO 38 


33.560043 


45 


VAL 39 


6.448641 




ARG 40 


7.068311 




LYS 41 


15.291286 




ILE 42 


1.612160 




LEU 43 


1.880854 


50 


ARG 44 


16.906845 




ILE 45 


0.000000 




VAL 46 


2.312647 




PHE 47 


2.955627 




HIS 48 


20.392527 


55 


ASP 49 


4.238116 
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ALA 


50 


0.510757 




ILE~ 


"51 


1.576962 




gly" 


"52 


2.858601 




phe" 


"53 


48.633503 


5 


ser" 


"54 


8.973248 




pro" 


"55 


58.822315 




ALA" 


"56 


59.782852 




LEU' 


"57 


46.483955 




thr" 


"58 


86.744827 


10 


ala" 


"59 


89.515816 




ALA~ 


"60 


81.163239 




GLY~ 


"61 


70.119019 




GLN" 


'62 


112.635498 




phe" 


"63 


93.522354 


15 


gly" 


"64 


2.742587 




GLY" 


"65 


13.379636 




GLY" 


"66 


22.722847 




gly" 


"67 


0.000000 




ala" 


"68 


0.268693 


20 


asp" 


'69 


12.074840 




gly" 


"70 


0.700486 




SER~ 


"71 


0. 000000 




ILE" 


"72 


0.000000 




ile" 


"73 


0.000000 


25 


ala" 


"74 


17.304443 




his" 


"75 


41.071186 




ser" 


"76 


20.000793 




asn" 


'77 


120.855316 




ile" 


"78 


66.574982 


30 


glu" 


"79 


2.334954 




leu" 


"80 


41.329689 




ala" 


"81 


77.370575 




phe" 


"82 


38.758774 




pro" 


"83 


131.946289 


35 


ala" 


"84 


34.893864 




asn" 


"85 


5.457000 




gly" 


"86 


43.364151 




gly" 


"87 


51.561348 




leu" 


"88 


0.242063 


40 


thr" 


"89 


73.343575 




asp" 


"90 


130. 139389 




thr" 


"91 


17.863211 




ile" 


"92 


0.268693 




glu" 


"93 


92.210396 


45 


ALA" 


"94 


35. 445068 




leu" 


"95 


1.343467 




arg" 


"96 


31.175611 




ALA" 


"97 


44.650192 




VAL~ 


"98 


17.698566 


50 


GLY" 


"99 


1.471369 




ILE~ 


"100 


62.441463 




asn" 


"101 


107. 139748 




his" 


"102 


46.952496 




gly" 


"103 


46.559296 


55 


val" 


"104 


11.342628 




ser" 


"105 


15.225677 




phe" 


"106 


6.422011 
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78 





GLY 


107 


3.426864 




ASP 


"108 


10.740790 




LEU" 


"109 


0.268693 




ile" 


"110 


1.880854 


5 


gln" 


"ill 


31.867456 




phe" 


"112 


0.000000 




ala" 


"113 


0.000000 




THR 


"114 


3.656114 




ALA 115 


8.299393 


10 


VAL 116 


0.268693 




GLY 


117 


0.268693 




MET 


"118 


3.761708 




SER~ 


"119 


14 .536770 




ASN~ 


"120 


25.928799 


15 


CYS~ 


"121 


0.537387 




PRO" 


"122 


29 .798336 




gly" 


"123 


33 . 080013 




SER 


124 


17 .115562 




pro" 


"125 


36.908714 


20 


ARG 


"126 


108.274727 




LEU" 


"127 


21.238588 




glu" 


"128 


53 .742313 




PHE 


"129 


3.761708 




LEU 


"130 


12 .928699 


25 


THR~ 


"131 


10.414591 




GLY~ 


"132 


47 .266495 




ARG~ 


"133 


12 .247048 




SER 


"134 


63.047237 




ASN 


"135 


31.403708 


30 


SER~ 


"13 6 


97.999619 




SER~ 


"137 


28.505201 




GLN 


"138 


102.845520 




PRO 


"139 


49.691917 




SER~ 


"140 


9. 423104 


35 


pro" 


'141 


25.724171 




pro" 


"142 


80.706665 




SER 


143 


105.318176 




LEU 


"144 


20.154398 




ILE 


"145 


41.288322 


40 


pro" 


"146 


10.462679 




gly" 


"147 


19.803421 




PRO 


"148 


18.130360 




GLY 


"149 


47.391853 




ASN 


"150 


60.248917 


45 


THR" 


"151 


87.887985 




VAL~ 


"152 


13.870322 




THR 


"153 


74.664734 




ALA 


"154 


45. 251106 




ILE 


"155 


2.686934 


50 


LEU 


"156 


28.720940 




ASP" 


"157 


110. 081253 




ARG 


"158 


31. 228874 




MET 


"159 


1.612160 




GLY" 


"160 


38.223858 


55 


ASP" 


"161 


46.293152 




ALA~ 


"162 


9.877204 




GLY" 


"163 


34.267326 
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PHE 


164 


11.057570 




SER~ 


"165 


51.158882 




pro" 


"166 


62.767738 




asp" 


"167 


75. 164917 


5 


GLU~ 


"168 


43.334976 




VAL 


"169 


6.365355 




VAL" 


'170 


2 .955627 




asp" 


"171 


7.004863 




leu" 


"172 


1.880854 


10 


leu" 


"173 


3 . 197691 




ALA" 


"174 


0.000000 




ALA" 


"175 


1.074774 




HIS" 


"176 


0.502189 




ser" 


"177 


0.806080 


15 


leu" 


"178 


3 . 197691 




ALA" 


"179 


3 . 337480 




SER" 


'180 


0.466991 




gln 


"181 


2 . 122917 




glu" 


"182 


40.996552 


20 


gly" 


"183 


62 . 098671 




leu" 


"184 


23.954853 




ASN 


"185 


15.918136 




SER 


"186 


95.185318 




ALA" 


"187 


59.075272 


25 


ILE 


"188 


27.675419 




PHE 


"189 


102.799423 




arg" 


"190 


55. 265549 




ser" 


"191 


6.986028 




pro" 


"192 


2.686934 


30 


leu 


"193 


12.321225 




asp" 


"194 


2.127163 




ser 


"195 


33 .556419 




THR~ 


"196 


33 .049286 




PRO" 


"197 


20.874798 


35 


GLN~ 


"198 


65.729698 




val" 


"199 


31.705818 




phe" 


"200 


4.753195 




ASP - 


"2 01 


13.744506 




thr" 


"202 


1.612160 


40 


gln" 


"203 


16.081930 




phe" 


"204 


2.581340 




tyr"" 


"205 


1.880854 




ile" 


"206 


9.356181 




glu 


"207 


0.735684 


45 


thr" 


"208 


10.685907 




leu" 


"209 


9.672962 




leu" 


"210 


2.955627 




LYS" 


"211 


77.176834 




gly" 


"212 


40.968609 


50 


thr" 


"213 


78.718216 




thr" 


"214 


21.738384 




gln" 


"215 


77.622299 




pro" 


"216 


25.441587 




gly" 


"217 


8.320850 


55 


pro" 


218 


96.972305 




ser" 


"219 


64.627823 




leu" 


"220 


85.732414 
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ZZ X 


27 111 
Z / * JQ1J.11 




pup" 


"222 


1 ~xa 620178 




aT.a" 


Z Z <J 


•> H7T014 




nT.TT - 


~2 2 4 

Z Z 4 


12 1 41763 


D 


(ZT .TT~ 


ZZ3 


fi5 129R6R 




T T?TT~~ 


Z Z o 








~2 2 7 






PTJff 


Z 46 o 


7 017754 




PHE~ 


~229 


0 000000 


10 


PRO~ 


230 


47 . 827423 




GLY~ 


231 


23 . 790522 




GLU~ 


232 


6 . 643466 




PHE~ 


233 


6 .713862 




ARG~ 


234 


18 012030 


1 5 


WFT - 


"2 3 5 


A SQfilRR 




JDfl" 


£ JO 


Q 1 41 5581 




CCD - 




1 q n 0 1 1 1\ 
x • y ozxzd 




R CD - 


O ^ Q 
ZOO 


O • Z4CPO / X 




AT A _ 


z j y 


XZ . 03 / ZOO 


z u 




Z ** U 


/ O • OZUDZO 






Z 4 J. 


O • Z O Z X 






Z 4 Z 






ARG~ 


24 3 


77 207176 




ASP~ 


~244 


36 207306 


25 


ser" 


245 


104 023796 






^ T U 


121 852341 




THR~ 


247 


2 955627 




AT.A - 


2 48 

^ ** O 


4 81 0700 




vlu 


O *t 


4 / • J JXJUu 




Apr:" 




t UU6 / / O 




qipp" 

xx\ir 


~9 m 


9 A 1 ft 9 A 1 




V7J-I1N 


^ i? 


3 * 3 Oft ^ 




SER" 


"253 






met" 


"254 


1 124224 


35 


thr" 


"255 


O OOOOOO 




SER~ 


"256 


53 . 758987 




SER~ 


"257 


37 276134 




asn" 


"258 


44 381340 




GLU~ 


"259 


149 .565140 


40 


VAL~ 


"260 


57 500389 




MET 261 


2 fi79114 




GLY 262 


in 1 7fi1 R2 




GLN 


263 


1 07 4^RQ1 fi 
lv / .4Z70710 




ARG~ 


"264 


00.40ZXOU 


A f\ 


TYR~ 


"265 


U. iS JJ43D 




arg" 


"266 


Q1 17QA1Q 

yx.x/yoxy 




ALA" 


"267 






ALA" 


"268 






MET" 


"269 


17 1 2?ni 1 


50 


ALA" 


"270 


22.455158 




LYS~ 


"271 


73.386177 




MET" 


"272 


3.959508 




SER" 


"273 


15.043281 




VAL" 


"274 


23.887930 


55 


LEU" 


"275 


17.196379 




GLY" 


"276 


44.362202 




phe" 


"277 


68.062485 
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ASP 


278 


94.902039 




ARG 


"279 


113.549011 




ASN 


"280 


134.886017 




ALA~ 


"281 


72.340973 


5 


LEU 


"282 


26.692348 




THR~ 


'283 


27.696728 




ASP 


284 


72.214157 




CYS 


'285 


0.000000 




SER 


"286 


28.209335 


10 


ASP" 


"287 


64.560753 




VAL 


'288 


7.040061 




ILE 


'289 


8.665112 




pro" 


'290 


48.682365 




SER 


"291 


86.141670 


15 


ALA~ 


"292 


29.031240 




val" 


"293 


84.432014 




ser" 


'294 


85.944153 




ASN - 


'295 


49.017288 




asn" 


"296 


133.459198 


20 


ala" 


"297 


57.283794 




ALA~ 


"298 


65.233749 




PRO 


"299 


24.751518 




VAL - 


"300 


45.409184 




ile - 


"3 01 


8.060802 


25 


PRO 


"302 


14.742939 




GLY 


"303 


16.589832 




GLY 


"3 04 


34.238071 




LEU 


"305 


24.719791 




thr" 


"306 


49 .356300 


30 


VAL 


"307 


71.491821 




ASP" 


"308 


130.906174 




ASP 


"309 


31.733070 




ile" 


"310 


19 .581894 




GLU" 


"311 


81.414574 


35 


VAL~ 


"312 


94.769890 




SER 


"313 


39. 688896 




CYS 


"314 


9.998511 




PRO" 


"315 


120.328018 




SER~ 


"316 


95. 364319 


40 


GLU" 


"317 


65.560959 




PRO" 


"318 


100.254364 




phe" 


"319 


46.284115 




pro" 


"320 


31.328060 




GLU" 


"321 


177.602249 


45 


ILE" 


"322 


33.449741 




ala" 


"323 


46.892982 




thr" 


"324 


79.976471 




ala" 


"325 


36.423820 




ser" 


326 


124 . 467422 


50 


gly" 


"327 


28.219524 




pro" 


~328 


107.553696 




leu" 


~329 


86.789825 - 




pro" 


"33 0 


34 .287163 




ser" 


"331 


75.764053 


55 


leu" 


~332 


32 .840569 




ala" 


"333 


61.516434 




pro" 


"334 


82.389992 
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ALA_335 6.246871 
PRO_336 56.750813 
HEM_337 60.435017 
CA_338 2.078997 
5 CA_339 0.000000 
NAG_340 141.534668 
NAG_341 186.311371 
Subset REST: 

restmole. list 
10 Subset REST: 

ARP: 9, 69-70 ,125 ,127,133,299-301,334-33 6 
restatom. list 
Subset REST: 

ARP : SER 9:N,CA,C,0,CB,OG 
15 ARP : GLY 69:N,CA,C,0 

ARP : GLN 70:N,CA,C,O,CB,CG,CD,0El,NE2 
ARP : GLY 125:N,CA / C,0 
ARP : SER 127 :N,CA, 0,0,08,00 
ARP:PRO 133:N,CA,CD,C,0,CB,CG 
20 ARP : SER 299 :N,CA,C,0,CB,0G 

ARP : ALA 300:N,CA,C,O,CB 
ARP : VAL 301:N,CA,C,O,CB,CGl,CG2 
ARP : SER 334 :N,CA,C,0,CB,OG 
ARP : GLY 335:N,CA,C,0 
25 ARP:PR0 336:N,CA,CD,C,0,CB,CG 

Subset SUB5B: 

sub5bmole . list 
Subset SUB5B: 

ARP: 10-11, 34, 38, 65-68, 71-72, 120-121 ,12 3-12 4, 
30 128-132,134,270,274, 

ARP: 297-298, 302-303 ,311-312, 332-333, 337-338 
sub5batom. list 
Subset SUB5B: 



35 



40 



45 



50 



55 



ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 



VAL 10:N,CA,C,0,CB,CG1,CG2 

THR 11:N,CA,C,0,CB,0G1,CG2 

GLN 34 :N, OA, 0,0, CB,CG, CD,0E1,NE2 

TYR 38:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

LEU 65:N,CA,C,0,CB,CG,CD1,CD2 

THR 66:N,CA, 0,0,08,001,062 

ALA 67:N,0A,C,O,CB 

ALA 68:N,CA,C,0,CB 

PHE 71:N,CA,C / 0 / CB,CG,CDl,CD2,CEl,CE2,CZ 
GLY 72 :N, OA, 0,0 

PHE 120:N,CA,C,O,CB,CG,CDl,CD2,CEl,CE2,CZ 

ALA 121:N,CA,C,0,CB 

ALA 123:N,CA,C,0,CB 

VAL 124:N,CA,C,0,CB,CG1,CG2 

ASN 1 2 8 : N , OA ,0,0, CB , CG , OD 1 , ND2 

CYS 129:N,CA,C,0,CB,SG 

PRO 130:N,CA,CD,C,O,CB,CG 

GLY 131:N,CA,C,0 

SER 132:fr,CA,C,0,CB,OG 

ARG 134 :N, OA, 0,0, CB,CG, CD,NE, CZ, NH1,NH2 
GLY 270:N,CA,C,O 

ARG 274:N,CA,C,O,0B,CG,CD,NE,CZ,NHl,NH2 
ILE 297:N,CA,C,0,CB,CG1,CG2,CD1 
PRO 298:N,CA,CD,C,0,CB,CG 
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ARP:SER 302:N,CA,C,O,CB,OG 
ARP:ASN 303 :N, CA, C,0, CB, CG,0D1 ,ND2 
ARP : GLY 311:N,CA,C,0 
ARP:GLY 312:N,CA,C,0 
5 ARP:THR 332 :N,CA,C, 0,CB / 0G1,CG2 

ARP:ALA 333:N,CA,C,0,CB 
ARP: LEU 337 :N, CA, C, O, CB, CG, CD1 , CD2 
ARP: PRO 338:N,CA,CD,C,0,CB,CG 
Subset ACTSITE: 
10 actsitentole. list 
Subset ACTSITE: 

ARP: 44-61, 75-77, 79-80, 87-88, 90-96, 

99,118,122,126,135,148-149,152 
ARP: 163-164 , 167 , 176-194 , 197-205 , 207-20 

213,216,230-231,241, 
ARP: 243-246, 249, 259, 273 ,277,280, 343-34 
actsiteatom. list 



15 



-158, 
9,211- 

7H 



Subset ACTSITE 





ARP: 


GLU 


44 : 


20 


ARP: 


SER 


45 : 




ARP: 


PRO 


46: 




ARP: 


VAL 


47: 




ARP: 


ARG 


48: 




ARP: 


LYS 


49: 


25 


ARP: 


ILE 


50: 




ARP: 


LEU 


51: 




ARP: 


ARG 


52 : 




ARP: 


ILE 


53 : 




ARP: 


VAL 


54: 


30 


ARP: 


PHE 


55: 




ARP. 


'HIS 


56: 




ARP: 


ASP 


57: 




ARP: 


ALA 


58: 




ARP: 


ILE 


59: 


35 


ARP. 


.GLY 


60: 




ARP: 


PHE 


61: 




ARP: 


•GLY 


75: 




ARP ' 


.ALA 


76: 




ARP: 


.ASP 


77: 


40 


ARP: 


:SER 


79: 




ARP 


:ILE 


80: 




ARP: 


:GLU 


87: 




ARP: 


:LEU 


88: 




ARP 


:PHE 


90: 


45 


ARP ' 


:PRO 


91: 




ARP: 


: ALA 


92: 




ARP 


:ASN 


93: 




ARP: 


:GLY 


94: 




ARP- 


:GLY 


95: 


50 


ARP 


:LEU 


96: 




ARP 


:THR 


99: 




ARP 


:ILE 


118 




ARP 


:THR 


122 




ARP 


:MET 


126 


55 


ARP 


:LEU 


135 




ARP 


:SER 


148 




ARP 


:PRO 


149 



N,CA,C,0,CB,CG,CD,OEl,OE2 

N,CA,C,0,CB,OG 

N , CA , CD , C , O , CB , CG 

N,CA,C,0,CB,CG1,CG2 

N , CA , C , O , CB , CG , CD , NE , C Z , NH1 , NH2 

N,CA,C,0,CB,CG,CD,CE,NZ 

N,CA,C f O,CB,CGl,CG2 ,CD1 

N,CA,C,0,CB,CG,CD1,CD2 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

N,CA,C,0,CB,CG1,CG2 ,CD1 

N,CA,C,0,CB,CG1,CG2 

N , CA , C , O , CB , CG , CD 1 , CD2 , CE 1 , CE2 , CZ 
N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
N,CA,C,0,CB,CG,0D1,0D2 
N,CA,C,0,CB 

N,CA,C,0,CB,CG1,CG2,CD1 
N,CA,C,0 

N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

N,CA,C,0 

N,CA,C,0,CB 

N,CA,C,0,CB,CG,0D1,0D2 

N,CA,C,0,CB,OG 

N,CA / C,0,CB,CG1,CG2,CD1 

N,CA,C,0,CB,CG,CD,OE1,OE2 

N,CA,C,0,CB,CG,CD1,CD2 

N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

N , CA , CD , C , O , CB , CG 

N,CA,C,0,CB 

N / CA,C,0,CB,CG,0D1,ND2 

N,CA,C,0 

N,CA,C,0 

N / CA / C,0,CB,CG,CD1,CD2 
N,CA,C,0,CB,0G1,CG2 
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ARP: 


LEU 


152 


:N 


CA, 


C 


rO, 


CB , CG , CD1 , CD2 




ARP 


:ILE 


153 


:N 


rCA, 


C 


-o ( 


CB , CGI , CG2 , CD1 




ARP' 


. PRO 


154 


:N, 


CA, 


CD , C , O , CB , CG 




ARP 


: GLY 


155 


:N, 


,CA, 


C 


rO 




5 


ARP 


.PRO 


156 


:N 


CA, 


CD,C,0,CB,CG 




ARP. 


:GLY 


157 


:N, 


CA, 


C 


,o 






ARP. 


:ASN 


158 


:N, 


CA, 


C 


,0, 


CB,CG,0D1,ND2 




ARP: 


ILE 


163 


,N ( 


CA, 


C 


rO, 


CB , CGI , CG2 , CD1 




ARP ' 


. LEU 


164 


:N, 


CA, 


c, 


rO, 


CB/CG ,001,002 


10 


ARP: 


MET 


167 


:N, 


CA, 


C 


-0, 


CB,CG,SD,CE 




ARP: 


GLU 


176 


■N ( 


CA ( 


c, 


o, 


CB,CG,CD,0E1,0E2 




ARP. 


VAL 


177 


:N, 


CA, 


C 


rO, 


CB , CGI , CG2 




ARP: 


VAL 


178' 


N, 


CA, 


c, 




CB,CG1,CG2 




ARP: 


ASP 


179. 


,N, 


CA, 


C 


,0, 


06,00,001,002 


15 


ARP: 


LEU 


180' 


N, 


CA, 


c< 


o, 


CB,CG,CD1,CD2 




ARP: 


LEU 


181. 


•N, 


CA, 


c, 




CB,CG, CD1,CD2 




ARP: 


ALA 


182: 


N, 


CA, 


c< 




CB 




ARP: 


ALA 


183' 


N, 


CA, 


c, 


o, 


CB 




ARP: 


HIS 


184 


N, 


CA, 


c, 


o, 


CB,CG,ND1,CD2,CE1, 


20 


ARP: 


SER 


185: 


N, 


CA, 


c. 


o, 


CB,0G 




ARP: 


LEU 


186 


N, 


CA, 


c, 


o, 


CB,CG,CD1,CD2 




ARP: 


ALA 


187. 


N, 


CA, 


c, 


o, 


CB 




ARP: 


SER 


188. 


N, 


CA, 


c, 


o, 


CB,OG 




ARP* 


GLN 


189' 


N, 


CA, 


c, 


o, 


CB , CG , CD , OEl , NE2 


25 


ARP: 


GLU 


190: 


N, 


CA, 


c, 


o, 


CB,CG,CD,0E1,0E2 




ARP: 


GLY 


191 


N, 


CA, 


c, 


o 


CB , CG , CD1 , CD2 




ARP: 


LEU 


192: 


N, 


CA, 


c, 


o, 




ARP. 


ASN 


193. 


N, 


CA, 


c, 


o, 


CB,CG,ODl,ND2 




ARP: 


SER 


194' 


N, 


CA, 


c< 


o, 


CB,OG 


30 


ARP' 


PHE 


197: 


N, 


CA, 


c. 


o, 


08,00,001,002 , CE1, 




ARP 


ARG 


198' 


N, 


CA, 


c, 


o, 


CB,CG,CD,NE,CZ,NH1 




ARP: 


SER 


199: 


N, 


CA, 


c, 


o, 


CB,OG 




ARP. 


PRO 


200. 


N, 


CA, 


CD,C,0,CB,CG 




ARP 


LEU 


201' 


N, 


CA, 


c, 


o, 


CB,CG,CD1,CD2 


35 


ARP. 


ASP 


202: 


N, 


CA, 


c, 


o, 


CB,CG,ODl,OD2 




ARP 


.SER 


203: 


N, 


CA, 


c, 


o, 


CB,0G 




ARP: 


THR 


204: 


N ( 


CA, 


c, 


o, 


CB,0G1,CG2 




ARP ' 


.PRO 


205 


N, 


CA, 


CD,C,0,CB,CG 




ARP: 


VAL 


207: 


N, 


CA, 


c, 


o, 


CB,CG1,CG2 


40 


ARP 


:PHE 


208. 


•N, 


CA, 


c, 


o, 


CB,CG,CD1,CD2,CE1, 




ARP 


:ASP 


209' 


N, 


CA, 


c, 


o, 


CB,CG,0D1,0D2 




ARP: 


.GLN 


211' 


N, 


CA, 


c, 


Of 


CB,CG,CD,0E1,NE2 




ARP 


:PHE 


212 


:N, 


,CA, 


c, 


-O, 


CB,CG,CD1,CD2,CE1, 




ARP 


:TYR 


213 


N, 


CA, 




o, 


CB,CG,CD1,CD2,CE1, 


45 


ARP 


'THR 


216. 


•N 


CA, 


c 


o, 


CB,0G1,CG2 




ARP 


:PHE 


230 


.N 


CA, 


c 


o, 


CB , CG, CD1 , CD 2 , CE1 , 




ARP 


ALA 


231 


N ( 


CA, 


c, 


o, 


CB 




ARP 


:PHE 


241 


.N 


CA, 


c 


o, 


CB,CG,CD1,CD2,CE1, 




ARP 


:MET 


243 


N 


CA, 


c 


o, 


CB,CG,SD,CE 


50 


ARP 


: ARG 


244 


:N, 


CA, 


c, 


rO, 


CB,CG,CD,NE,CZ,NH1 




ARP 


:SER 


245 


:N 


CA, 


c 


rO, 


CB,0G 




ARP 


:ASP 


246 


:N 


CA, 


c 


0/ 


CB,CG,ODl,OD2 




ARP 


:LEU 


249 


:N 


r CA, 


c 


rO, 


CB,CG,CD1,CD2 




ARP 


:TRP 


259 


:N 


CA, 


c 


o, 


CB,CG,CD1, 


55 






CD2 , 


NE1,CE2 


,CE3,CZ2,CZ3,CH2 




ARP:TYR 


273:N 


,CA, 


c 


,o ( 


CB , CG , CD1 , CD2 , CE1 , 




ARP: MET 


277:N 


rCA, 


c 




CB,CG,SD,CE 



NE2 



. CE2 , CZ 
,NH2 



CE2 , CZ 

,CE2,CZ 
CE2,CZ,0H 

CE2,CZ 

. CE2 , CZ 

,NH2 



CE2,CZ,OH 
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ARP : MET 280:N, CA, C, O, CB, CG, SD, CE 
ARP: ALA 343:N,CA,C,0,CB 
ARP: PRO 344:N,CA,CD,C,O,0XT,CB,CG 
ARP: HEM 3 4 5H : FE , NA , NB , NC , ND , CHA , CHB , 
5 CHC , CHD , CIA, C2 A , C3A, C4 A , CMA , CAA , CBA , CGA 

ARP : HEM 345H:01A,02A,C1B,C2B,C3B,C4B,CMB, 

CAB, CBB, C1C,C2C, C3C, C4C, CMC, CAC, CBC 
ARP: HEM 345H:C1D,C2D,C3D / C4D,CMD,CAD,CBD,CGD,01D,02D 
ARP:CA 346H:CA 
10 ARP:CA 347H:CA 

Subset RESTx: 

restxmole . list 
Subset RESTX 

NEWMODEL: 9,334-336 
1 5 res txa torn .list 
Subset RESTX: 

NEWMODEL: SER 9 :N, CA, C,0, CB, OG 
NEWMODEL : SER 334 :N, CA, C, O, CB, OG 
NEWMODEL : GL Y 335:N,CA,C,0 
20 NEWMODEL : PRO 336 :N, CA,CD, C, O, CB, CG 



Example 5 

Activation of mPEG 15,000 with N-succinimidyl carbonate 

25 mPEG 15,000 was suspended in toluene (4 ml/g of mPEG) 20% was 

distilled off at normal pressure to dry the reactants 
azeotropically. Dichloromethane (dry 1 ml/g mPEG) was added when 
the solution was cooled to 30°C and phosgene in toluene (1.93 M 5 
mole/mole mPEG) was added and mixture stirred at room temperature 

30 over night. The mixture was evaporated to dryness and the desired 
product was obtained as waxy lumps. 

After evaporation dichloromethane and toluene (1:2, dry 3 
ml/g mPEG) was added to re-dissolve the white solid. AT-Hydroxy 
succinimide (2 mole/mole mPEG.) was added as a solid and then 

3 5 triethylamine (1.1 mole/mole mPEG) . The mixture was stirred for 3 
hours, initially unclear, then clear and ending with "a small 
precipitate. The mixture was evaporated to dryness and 
recrystallised from ethyl acetate (10 ml) with warm filtration to 
remove salts and insoluble traces. The blank liquid was left for 

40 slow cooling at ambient temperature for 16 hours and then in the 
refrigerator over night. The white precipitate was filtered and 
washed with a little cold ethyl acetate and dried to yield 98 % 
(w/w) . NMR Indicating 80 - 90% activation and 5 o/oo (w/w) 
HNEt 3 Cl. 1 H-NMR for mPEG 15,000 (CDC1 3 ) d 1.42 t (1= 4.8 CH 3 i 

45 HNEt 3 Cl) , 2.84 s (1= 3.7 succinimide), 3.10 dq (1= 3.4 CH 2 i 
HNEt 3 Cl) , 3.38 S (1= 2.7 CH 3 i OMe) , 3.40* dd (I = 4.5 o/oo, 13 C 
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satellite), 3.64 bs (I = 1364 main peak), 3.89* dd (I = 4.8 o/oo , 
13 C satellite), 4.47 dd (I = 1.8, CH2 in PEG). No change was seen 
after storage in a desiccator at 22 °C for 4 months. 

5 Example 6 

Activation of mPEG 5.000 with N-succinimidvl carbonate 

Activation of mPEG 5,000 with N-succinimidyl carbonate was 
performed as described in Example 5. 

10 EXAMPLE 7 

Construction and expression of PD498 variants ; 

PD498 site-directed variants were constructed using the "maxi- 

oligonucleotide-PCR" method described by Sarkar et al., (1990): 

BioTechniques 8: 404-407. 
15 The template plasmid was shuttle vector pPD498 or an analogue 

of this containing a variant of the PD498 protease gene. 

The following PD498 variants were constructed, expressed and 

purified. 

A: R28K 
20 B: R62K 

C: R169K 

D: R28K + R62K 

E: R28K + R169K 

F: R62K + R169K 
25 G: R28K+R69K+R169K 

Construction of variants 

For introduction of the R28K substitution a synthetic 

oligonucleotide having the sequence: GGG ATG TAA CCA AGG GAA GCA 
30 GCA CTC AAA CG (SEQ ID NO. 7) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 

prepared by Bst E II and Bgl II digestion. Positive variants were 

recognized by Styl digestion and verified by DNA sequencing of the 

total 769 bp insert. 
35 For introduction of the R62K substitution a synthetic 

oligonucleotide having the sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC {SEQ ID NO. 8) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
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prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by Clal digestion and verified by DNA sequencing of the 
total 769 bp insert. 

For introduction of the R169K substitution a synthetic 
5 oligonucleotide having the sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by the absence of a Rsa I restriction site and verified 
10 by DNA sequencing of the total 769 bp insert. 

For simultaneously introduction of the R28K and the R62K 
substitutions, synthetic oligonucleotides having the sequence: 
GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 7) and the 
sequence : 

15 CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and verified 
by DNA sequencing of the total 769 bp insert. 

20 For simultaneously introduction of the R28K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: GGG 
ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 8) and the 
sequence : 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 8) were used 
25 simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert. 

30 For simultaneously introduction of the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: CGA 
CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the sequence: 
CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 

35 pPD498" plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Clal digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert 
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For simultaneously introduction of the R28K, the R62K and the 
R169K substitutions, synthetic oligonucleotides having the 
sequence : 

GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID No. 7), the 
5 sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the 
sequence : 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
10 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and absence of 
a Rsa I site. The variant was verified by DNA sequencing of the 
total 769 bp insert. 

15 Fermentation, expression and p urification of PD498 variants 

Vectors hosting the above mentioned PD498 variants were 
purified from E . coll cultures and transformed into B. subtilis in 
which organism the variants were fermented, expressed and purified 
as described in the "Materials and Methods" section above. 

20 

Example 7 

mn-inaation of triple subs t ituted PD4Q8 variant with activated 
mPEG 5.000 

200 mg of triple substituted PD498 variant (i.e. the 
25 R28K+R62K+R169K substituted variant) was incubated in 50 mm 
NaBorate, pH 10, with 1.8 g of activated mPEG 5,000 with N- 
succinimidyl carbonate (prepared according to Example 2), in a 
final volume of 20 ml. The reaction was carried out at ambient 
temperature using magnetic stirring. Reaction time was 1 hour. The 
30 reaction was stopped by adding DMG buffer to a final concentration 
of 5 mM dimethyl glutarate, 1 mM CaCl 2 and 50 mM borate, pH 5.0. 

The molecule weight of the obtained derivative was approxi- 
mately 120 kDa, corresponding to about 16 moles of mPEG attached 
per mole enzyme. 

35 -Compared to the parent enzyme, residual activity was close to 
100% towards peptide substrate (succinyl-Ala-Ala-Pro-Phe-p- 
Nitroanilide) . 
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Exampl 8 

anpr gpnicitv trails of PD49 « vartant-SPBG5 . 000 in guinea pigs 

Dunkin Hartley guinea pigs are stimulated with 1.0 ug PD498- 
SPEG 5,000 and 1.0 ng modified variant PD498-SPEG 5,000 by 

5 intratracheal installation. 

Sera from immunized Dunkin Hartley guinea pigs are tested 
during the trail period in a specific IgGi ELISA (described above) 
to elucidate whether the molecules could activate the immune 
response system giving rise to a specific IgGi response indicating 
10 an allergenic response. 

The IgGi levels of Dunkin Hartley guinea pigs during the trail 
period of 10 weeks are observed. 

Example 9 ... 

15 suitable substitutions in H nmicola lanuginosa lipase for 

addition of amino at tachment groups ( -NHo ) 

The 3D structure of Humicola lanuginosa lipase (SEQ ID NO 6) 

is available in Brookhaven Databank as ltib.pdb. The lipase 

consists of 2 69 amino acids. 
20 The procedure described in Example 1 was followed. The 

sequence of H. lanuginosa lipase is shown below in the table 

listing solvent accessibility data for H. lanuginosa lipase. 

H. lanuginosa residue numbering is used (1-269), and the active 

site residues (functional site) are S146, S201 and H258. The 
25 synonym TIB is used for ff. lanuginosa lipase. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeKzone.bcl and makeKzone2 .bcl below: 

Conservative substitutions : 

3 0 makeKzone.bcl 

1 Delete Subset * . . 

2 Color Molecule Atoms * Specified Specification 255,0,255 

3 Zone Subset LYS :lys:NZ Static monomer /residue 10 
Color Subset 255,255,0 

35 4 Zone Subset NTERM :1:N Static monomer/residue 10 
Color Subset 255,255,0 

5 #N0TE : editnextline ACTSITE residues according to the 

protein . , 

6 Zone Subset ACTSITE : 146, 201, 258 Static monomer/residue 8 

40 Color Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object name according to the protein 
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10 Combine Subset REST Difference TIB ALLZONE 

11 List Subset REST Atom Output File restatom. list 

12 List Subset REST monomer/resTdue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 14 List Subset ACTSITE Atom Output File actsiteatom. list 

15 List Subset ACTSITE monomer/ residue Output_File 
actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
10 Color_Subset 

18 Combine Subset SUB 5 A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB 5 A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

15 21 List Subset SUB5B Atom Output File sub5batom. list 

22 List Subset SUB5B monomer /residue Output_File subSbmole. list 

23 #Now identify sites for lys->arg substitutions and continue 
with makezone2 . bcl 

24 #Use grep command to identify ARG in restatom. list , 
20 sub5batom. list & aces iteatom. list 

Comments: 

In this case of H . lanuginosa (=TIB) , REST contains the 
Arginines Argl33, Argl39, Argl60, Argl79 and Arg 209, and SUB5B 
25 contains Argll8 and R125. 

These residues are all solvent exposed. The substitutions 
R133K, R139K, R160K, R179K, R209K, R118K and R125K are 
identified in TIB as sites for mutagenesis within the scope of 
this invention. The residues are substituted below in section 
30 2, and further analysis done. The subset ACTSITE contains no 
lysines. 



Non-conservative substitutions: 
makeKzone2 .bcl 

35 1 #sourcefile makezone2 .bcl Claus von der Osten 961128 

2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 
40 5 Copy Object -To_Clipboard -Displace TIB newmodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On TIB 

9 #NOTE: editnextlines with lys->arg positions 
45-10 Replace Residue newmodel: 118 lys L 

11 Replace Residue newmodel: 125 lys L 

12 Replace Residue newmodel: 133 lys L 

13 Replace Residue newmodel: 139 lys L 

14 Replace Residue newmodel: 160 lys L 
50 15 Replace Residue newmodel: 179 lys L 

16 Replace Residue newmodel: 209 lys L 
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17 # 

18 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

19 Color Molecule Atoms newmodel Specified Specification 
5 255,0,255 

20 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

21 Zone Subset NTERMx newmodel: 1:N Static monomer/ residue 10 
Color_Subset 255,255,0 

10 22 /NOTE: editnextline ACTSITEx residues according to the 
protein 

23 Zone Subset ACTSITEx newmodel: 146, 201, 258 Static 
monomer/ residue 8 Color_Subset 255,255,0 

24 Combine Subset ALLZONEx Union LYSx NTERMx 

15 25 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

26 Combine Subset RESTx Difference newmodel ALLZONEx 

27 List Subset RESTx Atom Output_File restxa torn. list 

28 List Subset RESTx monomer/residue Output_File 
restxmole. list 

20 29 # 

30 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

31 List Subset ACTSITEx Atom Output_File actsitexatom. list 

32 List Subset ACTSITEx monomer/residue Output_File 
25 actsitexmole. list 

33 # 

34 #read restxatom. list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

3 0 Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 18,31-33,36,38,40,48,50,56-62,64,78,88,91-93,104- 
106,120,136,225,227-229,250,262,268. Of these three are 
Cysteines involved in disulfide bridge formation, and 
35 consequently for structural reasons excluded from the residues 
to be mutated. The following mutations are proposed in H. 
lemuginosa lipase (TIB) : 

A18K,G31K,T32K,N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
40 V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 
Relevant data for Example 2: 

# TIBN0H20 

# residue area 
GLU_1 110.792610 

45 VAL_2 18.002457 

SER_3 53.019516 

GLN_4 85.770164 

ASP_5 107.565826 

LEU_6 33.022659 

50 PHE_7 34.392754 

ASN 8 84.855331 
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GLN 


9 


39. 175591 




PHE~ 


"10 


2 . 149547 




ASN 


"ll 


40.544380 




LEU 


"12 


27. 648788 


5 


PHE~ 


"13 


2 .418241 




ALA 14 


4 . 625293 




GLN 


15 


28. 202387 




TYR 


16 


0 . 969180 




SER~ 


17 


0.000000 


10 


ALA~ 


18 


7. 008336 




ALA~ 


19 


0 . 000000 




ALA - 


20 


0 . 000000 




TYR~ 


21 


6. 947358 




CYS - 


22 


8. 060802 


15 


GLY - 


23 


32.147034 




LYS" 


24 


168. 890747 




ASN~ 


"25 


8 . 014721 




ASN~ 


"26 


11.8 15564 




ASP 


"27 


92 .263428 


20 


ALA 


28 


18 .206699 




pro" 


29 


83 . 188431 

\J ~J • all. \J \J *J J*. 




ala" 


30 


69 . 428421 




GLY - 


31 


50 . 693439 




THR - 


32 


52 . 171135 


25 


ASN~ 


"33 


111.230743 




ile" 


"34 


2.801945 




THR 


"35 


82 . 130569 




CYS~ 


"36 


17.269245 




THR" 


"37 


96.731941 


30 


GLY 


"38 


77 .870995 




asn" 


"39 


123 . 051003 




ALA~ 


"40 


27 .985256 




CYS" 


41 


0 . 752820 




PRO - 


"42 


46 . 258949 


35 


GLU" 


"43 


69 . 773987 




VAL~ 


44 


0.735684 




GLU~ 


"45 


77.169510 




LYS~ 


"46 


141.213562 




ALA~ 


"47 


10.249716 


40 


ASP~ 


"48 


109. 913902 




ala" 


"49 


2 . 602721 




THR." 


50 


32 .012184 




PHE~ 


51 


8 . 255627 




LElf 


"52 


60 . 093613 


45 


TYR" 


"53 


77 . 877937 




ser" 


"54 


26 . 980494 




phe" 


- 55 


10.747735 




GLU - 


56 


112 . 689758 




ASP" 


"57 


92 064278 


50 


SER" 


"58 


32.990780 




gly" 


"59 


53.371807 




val" 


"60 


83.563644 




gly" 


~61 


69.625633 




ASP" 


"62 


75.520988 


55 


VAL" 


"63 


4.030401 




thr" 


"64 


8.652839 




gly" 


- 65 


0.000000 
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PHE 


66 


0.268693 




LEU" 


"67 


11.822510 




ALA" 


"68 


0.537387 




LEU" 


"69 


30.243870 


5 


asp" 


"70 


0.000000 




asn" 


"71 


84.101044 




thr" 


"72 


89.271126 




asn" 


"73 


70.742401 




LYS" 


"74 


98 .319168 


10 


LEU" 


"75 


8.329495 




ile" 


"76 


5.197878 




VAL 77 


0.806080 




LEU 78 


5.293978 




SER 


79 


0.000000 


15 


PHE" 


"80 


2.079151 




arg" 


"81 


41.085312 




gly" 


"8 2 


1.471369 




SER 


"83 


43 .794014 




ARG 


"84 


100.261627 


20 


SER" 


"85 


70.607552 




ILE 


"8 6 


59 . 696865 




GLU 


"87 


136. 510773 




ASN~ 


"88 


119.376373 




TRP 


"89 


102.851227 


25 


ILE~ 


"9 0 


78.068588 




GLY~ 


"91 


60.783607 




ASN 


"92 


45.769428 




LEU~ 


"93 


134.228363 




ASN 


"94 


101. 810959 


30 


PHE 


"95 


41.212212 




ASP~ 


"96 


79.645950 




LEU~ 


"97 


25.281572 




LYS 


"98 


88.840263 




GLU~ 


"99 


132.377090 


35 


ILE~ 


JLOO 


9.135575 




ASN 


101 


63.444527 




ASP~ 


"102 


88.652847 




ile" 


"103 


33.470661 




CYS~ 


"104 


11.553816 


40 


SER" 


"105 


99.461174 




GLY" 


"106 


40.325161 




CYS 


"107 


4.433561 




ARG~ 


"108 


97.450104 




GLY~ 


"109 


1.343467 


45 


HIS 


"110 


4.652464 




ASP" 


"ill 


37.023655 




GLY 


"112 


29.930408 




PHE 


"113 


14.976435 




THR~ 


"114 


10.430954 


50 


SER~ 


"115 


40.606895 




SER" 


'116 


13.462922 




TRP" 


"117 


10.747735 




ARG 


"118 


114.364281 




SER" 


"119 


46.880249 


55 


VAL 


120 


13.434669 




ALA 


"121 


18.258261 




ASP" 


"122 


110.753098 
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94 





THR_ 


1 *i "5 

123 


/fft zTyl 1 QOO 

by • o4i?^^ 






124 


1*7 no m Q / 

i / - uyu / o«* 






12 D 






GJjN_ 


12 6 


iai lotion 


5 


LiYS 


12 / 


o4 • 4DU^41 




VAL 


128 


6 • 448641 




GLU_ 


129 


47 . 700993 




ASP 


130 


75 . 529091 




ALA_ 


13 1 


11 . 340775 


1 a 
10 


VAL 


132 






ARG_ 


13 3 


ID J . 1 Jo4yo 




tjJ-lU 


1 J 4 


1 1 *> "1/1 nco^ 




tit r»~" 
nio 


'lie 
1JD 


R/l K /l AC 




irJKvJ 




Q *7 T D CQC'l 
?/ iJODjOJ 




A CD - 

AoJr 


ij / 


• D3J13J. 




TiK 


1 JO 






ARG 


139 


/4 . J2124 J 




TT A T 

VALi_ 


"l A A 

140 


"1A 

10 . 1 / J222 




T771 T 

VALj 


14 1 


0 . 2 J J4aD 


20 


PHE_ 


14 2 


3 . 224321 




THR_ 


14 3 


0 . 000000 




GLY 


144 


0 . 000000 




HIS 


145 


4 . 514527 




SER_ 


14 6 


i c n ji Anon 

15 • 749787 


25 


LEU 


1 A *7 

147 


40 . 709171 




/IT « 

GLiY 


I/O 

148 


U • 000000 




GLY 


149 


f\ A /\ /-v O A A 

0 • 000000 




ALA 


1DU 


0 • oo /Jo/ 




T T?T1 
J-lCiU 


151 




o a 


TVT A 

ALA 


1D2 


n o a o £ o, *i 




lriK 


ID J 


-Lo • u / o / y o 




VAL 


"i e« 
1D4 






A T A~" 

ALA 


"ice 
133 


U • UUUUUU 




it v 


"ice 
ISO 


n nnnnnn 

V ♦ UUUUUU 


Jj 


AT A~" 


ID / 


ID • 14UZ J U 






"icq 
IDo 


41. o4t>4 / / 




T T7>TT 

LEU 


159 


£. 1 A A *? c n 

6 • 144750 




ARG 


1 £Z A 

160 


41 . 939716 




GLY 


161 


bo • 9 /oloU 


A A 

41) 


Ti O VT 


lc> 2 


bo • 24joUd 




GLY_ 


ICO 

163 


/9 • lolz /4 




mVD 
1 XK 


164 


•)£ 1 oAovn 
O O • l7VJz4 / 




Abr 


"i cc 
1D3 


1U J • UbozoJ 






16 6 


U . UUUUUU 


vl K 
4 O 


Ti C T3 _ 


xo / 


Z4 • J ZD44J 




T77i T 

VAL 


ICQ 

loo 


A OQOAQ4 

4 • z9y oy 4 






"icq 

loy 


0 • 4 66991 




SER 


1/0 


J . 339332 




TYR 


171 


0 . 000000 


50 


GLY _ 


~172 


0. 000000 




ALA" 


"173 


12 . 674671 




PRO 


~174 


13.117888 




ARG" 


~175 


10.004488 




VAL" 


"17 6 


21.422220 


55 


GLY" 


~177 


2. 680759 




ASN" 


~178 


21.018063 




arg" 


~179 


110.282166 
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ALA_180 
PHE_181 
ALA_182 
GLU_183 
5 PHE_184 
LEU_185 
THR_186 
VAL_187 
GLN_188 

10 THR_189 
GLY_190 
GLY_191 
THR_192 
LEU_193 

15 TYR_194 
ARG_195 
ILE_196 
THR_197 
HIS_198 

20 THR_199 
ASN_200 
ASP_201 
ILE_202 
VAL_203 

25 PRO_204 
ARG_205 
LEU_206 
PRO_207 
PRO_208 

3 0 ARG_209 
GLU_210 
PHE_211 
GLY_212 
TYR_213 

3 5 SER_214 

HIS_215 
SER_216 
SER_217 
PRO_218 

4 0 GLU_219 

TYR_220 
TRP_221 
ILE_222 
LYS_223 

4 5 SER_224 
GLY_225 
THR_226 
LEU_227 
VAL_228 

50 PRO_229 
VAL_23 0 
THR_231 
ARG_232 
ASN_233 

55 ASP_234 
ILE_23 5 
VAL 23 6 



33.210381 

4.567788 

3.897251 

76.354004 

71.225983 

24.985012 

47.023815 

98.244606 

54.152954 

88.660645 

24.792120 

10.726818 

45.458744 

16.633211 

34.829491 

29.030851 

1.973557 

3.493014 

1. 532270 

34.785877 

39.789238 

0. 000000 

31. 168434 

29.521076 

3.515322 

44.882454 

51.051746 

12.575329 

43.259636 

113.700233 

154.628540 

112.505188 

30.084938 

3.268936 

12.471436 

23.354481 

16.406200 

14.665598 

17.240993 

13.145291 

18.718306 

39.229233 

5.105175 

120.739983 

15.407301 

29.306646 

66.806862 

122.682808 

60.923004 

104.620377 

23.398251 

63.372971 

80.357857 

89. 255066 

43.011250 

2.114349 

45. 140491 
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LYS 


237 


105 . 651306 




ile" 


"238 


24 . 671705 




GLU~ 


"239 


"1 «1 y~ ft ft «• ^\ ft 

116 . 891907 




GLY~ 


"240 


31. 965794 


5 


ile" 


"241 


£~ ft *T O ft ft, ft 

4o • 278 099 




asp" 


"242 


28 . 963 699 




ALA" 


"243 


25 • 158146 




THR~ 


"244 


ft. O 1 C1 il Jl A 

98 • J 51440 




GLY 245 


4 J . 842186 


1 ft 

10 


GLY 246 


ft *7 ft ft A O ^ 

O • / 00486 






<C«t / 


*5 ft O ft *7 Jl 

3 . 926274 




Aon 




rr i ft >l "7 0 ft ft 

51 • 047890 




VjJ-ilN 




c ^ n n 1 ft o 

66 • 699188 




nun" 


"Ten 


132 • 4 14047 


15 




2DX 


*T ft ft ** 1 *"» ft 

70 . 213730 




ILE~ 


"252 


^ A "1 j /-» o ft ^ ft 

141. 498062 




PRO" 


"253 


59 . 089233 




ASP" 


"254 


r~ ft. ft *t ft A ^"v r— 

59 . 010895 




ile" 


"255 


63 . 298943 


20 


PRO 


"256 


78 . 608688 




ALA 


"257 


/■> ft ft ^ ft ft ft 

0 . 806080 




HIS 


"258 


3 . 761708 




LEU 


"259 


DO . / 4 /ODD 




TRP~ 


"260 


3D . 229 /10 


25 


TYR 


"261 


5 . 440791 




PHE 


"262 


36.457939 




GLY 


"263 


22.071375 




LEU 


"264 


109. 148178 




ile" 


"265 


2.418241 


30 


GLY" 


"266 


17.730062 




THR" 


"267 


68.217873 




CYS 


"268 


15.418195 




LEU~ 


"269 


165.990997 



Subset REST: 
35 restmole. list 
Subset REST: 

TIB : 5 , 8-9 , 13-14 , 16 , 18-2 0 , 3 1-34 ,36,38,40,48-50,56- 
66, 68,76-79,88,91-93, 

TIB: 100-107, 116-117, 119-121, 132-134, 136, 139-142, 154- 
40 169,177-185, 

TIB: 187, 189-191, 207-212, 214-216, 225, 227-229, 241- 

244,250,262,268 
restatom. list 
Subset REST: 
45 TIB:ASP 5 :N, CA, C , 0 , CB , CG , 0D1 , 0D2 

TIB: ASN 8 : N , CA , C , 0 , CB , CG , OD1 , ND2 

TIB: GLN 9 :N, CA, C, O, CB, CG, CD, OEl,NE2 

TIB : PHE 13 : N , CA , C , 0 , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ 

TIB: ALA 14 :N, CA, C,0, CB 
50 TIB: TYR 16 :N, CA, C,0, CB, CG, CD1, CD2 ,CE1, CE2 ,CZ ,OH 

TIB: ALA 18:N,CA,C,0,CB 

TIB: ALA 19 : N, CA, C,0, CB 

TIB: ALA 20:N,CA,C,0,CB 

TIB : GLY 31:N,CA,C,0 
55 TIB: THR 32 :N, CA, C,0, CB, 0G1, CG2 

TIB: ASN 33:N,CA,C,0,CB,CG,ODl,ND2 

TIB: ILE 34:N,CA,C,0,CB,CG1,CG2,CD1 



- WO 98/35026 



97 



PCT/DK98/00046 





TIB: 


CYS 


36: 


N . CA . C , 


0 , CB , SG 




TTR' 


GLY 


38 : 


N - CA . C . 


0 




TTR ' 

X u ■ 


ALA 


40: 


N , CA , C , 


0, CB 




TTR ' 

_L J- U < 


ASP 


48 ; 


• N CA . C . 


O . CB . CG . OD1 , OD2 




X J- u « 


ALA 


49 : 


: N . CA . C . 


0,CB 




TIB: 


THR 


50 : 


:N, CA, C, 


0,CB,OGl,CG2 




TIB: 


GLU 


56 . 


•N, CA, C, 


O, CB, CG, CD,OEl,OE2 




TIB: 


ASP 


57 : 


: N , CA , C , 


O , CB , CG , OD1 , OD2 




TIB: 


SER 


58' 


:N f CA,Cj 


O , CB , OG 


10 


TIB: 


GLY 


59. 


;N, CA, C, 


o 




TIB: 


VAL 


60: 


:N,CA,C, 


0,CB,CG1,CG2 




TIB: 


GLY 


61 


: N , CA , C , 


o 




TIB: 


•ASP 


62 


:N, CA, C, 


O, CB , CG , OD1 , OD2 




TIB: 


VAL 


63 


: N CA - C 


O , CB , CGI , CG2 




TIB 


:THR 


64 


■ M . CA - C 


0,CB,0G1,CG2 




X ID « 


■ cn.v 










TTR ' 
x xd « 


PHF 


\j \j 


■ N CA C 


0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 




TIB 


: ALA 


68 


: N , CA , C , 


O, CB 




TIB, 


:ILE 


76 


:N,CA,C, 


0,CB,CG1,CG2,CD1 


20 


TIB 


:VAL 


77 


:N,CA,C, 


0,CB,CG1,CG2 




TIB 


:LEU 


78 


:N,CA,C ( 


0,CB,CG,CD1,CD2 




TIB 


:SER 


79 


:N,CA,C, 


0,CB,OG 




TIB 


:ASN 


88 


:N,CA,C, 


r O,CB,CG,ODl,ND2 




TIB 


: GLY 


91 


:N,CA,C 


r O 


25 


TIB 


:ASN 


92 


:N,CA,C, 


O, CB, CG, ODl,ND2 




TIB 


:LEU 


93 


:N,CA,C, 


r O,CB,CG,CDl,CD2 





TIB 


:ILE 


100: 


N,CA, 


c, 


o ( 


CB,CG1,CG2,CD1 




TIB 


:ASN 


101: 


N,CA, 




o, 


CB, CG, 0D1 ,ND2 




TIB 


:ASP 


102: 


N,CA, 


c, 


o, 


CB,CG,ODl,OD2 


30 


TIB 


:ILE 


103 : 


N,CA, 


c, 


o, 


CB,CG1,CG2,CD1 




TIB 


:CYS 


104: 


N,CA, 


c, 


o, 


CB,SG 




TIB 


:SER 


105: 


N,CA, 


c, 


o, 


CB,0G 




TIB 


:GLY 


106: 


N,CA ( 


c, 


0 






TIB 


:CYS 


107:N,CA, 


c, 


o, 


CB,SG 


35 


TIB 


:SER 


116:N,CA, 


c, 


o, 


CB,OG 




TIB 


:TRP 


117:N,CA, 


c, 


o, 


CB,CG,CD1,CD2,NE1 / CE2, 




CE3,CZ2 


,CZ3 


,CH2 










TIB 


:SER 


119: 


N, CA, 


c, 


o, 


CB,0G 




TIB: VAL 


120: 


N,CA, 




o, 


CB,CG1,CG2 


40 


TIB 


: ALA 


121: 


N,CA, 


c, 


o, 


CB 




TIB: VAL 


132: 


N,CA, 


c, 


o, 


CB,CG1,CG2 




TIB 


: ARG 


133: 


N,CA, 


c, 


o\ 


r CB,CG,CD,NE,CZ,NHl,NH2 




TIB 


:GLU 


134: 


N,CA ( 


c, 


o, 


CB,CG,CD,OEl,OE2 




TIB 


:PRO 


136: 


N,CA, 


CD,C,0,CB,CG 


45 


TIB 


: ARG 


139 


.N,CA, 


C 


o 


r CB,CG,CD,NE,CZ,NHl,NH2 




TIB 


:VAL 


140 


>N,CA, 


C 


o 


r CB,CGl,CG2 




TIB 


:VAL 


141 


N,CA, 


C 


,o 


,CB,CG1,CG2 




TIB 


:PHE 


142 


:N,CA, 


C 




r CB,CG,CDl,CD2,CEl,CE2,CZ 




TIB 


:VAL 


154 


:N,CA, 


C 


o 


r CB,CGl,CG2 


50 


TIB 


: ALA 


155 


N,CA, 


rC 


r o 


r CB 




TIB: GLY 


156 


:N,CA 


r c 








TIB 


: ALA 


157 


:N,CA, 


c 


o 


,CB 




TIB 


: ASP 


158 


:N,CA 


r c 


r o 


,CB,CG,ODl,OD2 




TIB 


:LEU 


159 


:N,CA 


r c 


r o 


r CB,CG,CDl,CD2 


55 


TIB 


: ARG 


160 


:N,CA 


,c 


r o 


r CB,CG,CD,NE,CZ,NHl,NH2 




TIB 


: GLY 


161 


:N,CA 


r c 


r o 






TIB 


:ASN 


162 


:N,CA 




,o 


,CB,CG,ODl,ND2 
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TIB 


:GLY 


163 


:N 


CA, 


C,0 






TIB 


:TYR 


164 


:N 


CA, 


C,0 f CB, CG,CD1,CD2,CE1, 


CE2 , CZ,0H 




TIB 


:ASP 


165 


:N 


CA, 


C,0,CB,CG,0D1,0D2 






TIB 


. ILE 


166 


:N ( 


CA, 


C , 0 , CB , CGI , CG2 , CD1 




5 


TIB 


:ASP 


167 


:N ( 


CA, 


C,0,CB,CG,0D1,0D2 






TIB 


■VAL 


168 


:N, 


CA, 


C,0,CB,CG1,CG2 






TIB 


:PHE 


169 


:N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2 , CZ 




TIB 


:GLY 


177 


:N, 


,CA, 


CO 






TIB 


:ASN 


178 


:N, 


CA, 


C,0,CB,CG,0D1,ND2 




10 


TIB 


. ARG 


179 


:N, 


CA, 


C , 0 , CB , CG , CD , NE , CZ , NH1 


,NH2 




TIB 


.ALA 


180 


:N 


CA, 


C,O f CB 






TIB 


.PHE 


181 


:N, 


CA, 


C , 0 , CB , CG , CD1 , CD2 , CE1 , 


CE2 , CZ 




TIB 


ALA 


182 


:N, 


CA, 


C,0,CB 






TIB 


GLU 


183 


.N, 


CA, 


C,0,CB,CG,CD,OEl,OE2 




15 


TIB 


PHE 


184 


:N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2 , CZ 




TIB 


LEU 


185' 


■N, 


CA, 


C, 0,CB,CG,CD1,CD2 






TIB. 


"VAL 


187. 


•N, 


CA, 


C,0,CB,CG1,CG2 






TIB. 


THR 


189. 


•N, 


CA, 


C,0,CB,0G1,CG2 






TIB 


GLY 


190 


N, 


CA, 


C,0 




20 


TIB 


GLY 


191 


•N, 


CA, 


C,0 






TIB 


PRO 


207: 


N, 


CA, 


CD,C,0,CB,CG 






TIB- 


PRO 


208: 


N, 


CA, 


CD,C,0,CB,CG 






TIB 


ARG 


209' 


N, 


CA, 


C , 0 , CB , CG , CD , NE , CZ , NH1 


,NH2 




TIB 


GLU 


210: 


N, 


CA, 


C,0,CB,CG,CD, OE1,OE2 




25 


TIB: 


PHE 


211: 


N, 


CA, 


C , 0 , CB , CG , CD1 , CD2 , CE1 , 


CE2 , CZ 




TIB: 


GLY 


212: 


N, 


CA, 


C,0 






TIB. 


SER 


214. 


N, 


CA, 


C,0 f CB,OG 






TIB: 


HIS 


215. 


N, 


CA, 


C,0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB. 


SER 


216. 


N, 


CA, 


C,O,CB,0G 




30 


TIB' 


GLY 


225' 


N, 


CA, 


C,0 






TIB 


LEU 


227 


N, 


CA, 


C,0,CB, CG,CD1,CD2 






TIB 


VAL 


228 


N, 


CA, 


C,0,CB, CG1,CG2 






TIB' 


PRO 


229 


N, 


CA, 


CD , C , 0 , CB , CG 






TIB' 


ILE 


241' 


Nj 


CA, 


C,0,CB,CG1,CG2,CD1 




35 


TIB' 


ASP 


242 


N, 


CA, 


C,0,CB,CG,ODl,OD2 






TIB: 


ALA 


243 


N, 


CA, 


C,0,CB 






TIB' 


THR 


244 


N, 


CA, 


C,0,CB,OGl,CG2 






TIB: 


PRO 


250 


N, 


CA, 


CD, C, O f CB,CG 






TIB. 


•PHE 


262. 


•N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1 / 


CE2,CZ 


40 


TIB. 


■CYS 


268. 


'N ( 


CA, 


0,0,06,80 






Subset SUB5B: 












sub5raole. list 









Subset SUB5B: 

TIB: 3-4, 6-7, 10-12, 15, 22-23, 25-30, 35, 37, 39 ,41-42, 44-47, 51- 
45 55,67,69-70, 

TIB: 72 ,74-75, 94-99, 108-112, 114-115, 118, 122-126, 128- 
131,135,137-138, 

TIB: 186, 188, 192-195, 213, 217-219, 223-224, 230-23 1,234-235, 238- 
240, 

50 TIB:245,269 

sub5batom. list 
Subset SUB5B: 

TIB: SER 3 :N,CA,C,O,CB,0G 
TIB:GLN 4 :N,CA,C,0,CB / CG,CD,0E1,NE2 
55 TIB: LEU 6:N,CA,C,0,CB,CG,CD1,CD2 

TIB: PHE 7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
TIB : PHE 10 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
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TIB: 


ASN 


11: 




TIB: 


LEU 


12 : 




TIB: 


GLN 


15 : 




TIB: 


CYS 


22 : 


5 


TIB: 


GLY 


23 : 




TIB: 


ASN 


25: 




TIB: 


ASN 


26: 




TIB: 


ASP 


27: 




TIB: 


ALA 


28 : 


10 


TIB: 


PRO 


29 : 




TIB: 


ALA 


30: 




TIB: 


THR 


35: 




TIB: 


THR 


37 : 




TIB: 


ASN 


39 : 


15 


TIB: 


CYS 


41: 




TIB: 


PRO 


42: 




TIB: 


VAL 


44: 




TIB: 


GLU 


45: 




TIB: 


LYS 


46: 


20 


TIB: 


ALA 


47: 




TIB: 


PHE 


51: 




TIB: 


LEU 


52: 




TIB: 


TYR 


53 : 




TIB: 


SER 


54: 


25 


TIB: 


PHE 


55 : 




TIB: 


LEU 


67: 




TIB: 


LEU 


69 : 




TIB: 


ASP 


70: 




TIB: 


THR 


72 : 


30 


TIB: 


LYS 


74 : 




TIB: 


LEU 


75: 




TIB: 


ASN 


94: 




TIB: 


PHE 


95 : 




TIB: 


ASP 


96: 


35 


TIB: 


LEU 


97: 




TIB: 


LYS 


98: 




TIB: 


GLU 


99 : 




TIB: 


ARG 


108 




TIB' 


:GLY 


109 


40 


TIB: 


HIS 


110 




TIB: 


•ASP 


111 




TIB 


.GLY 


112 




TIB: 


THR 


114 




TIB 


: SER 


115 


45 


TIB: 


ARG 


118 




TIB 


:ASP 


122 




TIB 


:THR 


123 




TIB 


:LEU 


124 




TIB 


: ARG 


125 


50 


TIB 


:GLN 


126 




TIB 


:VAL 


128 




TIB 


:GLU 


129 




TIB 


:ASP 


130 




TIB 


: ALA 


131 


55 


TIB 


:HIS 


135 




TIB 


:ASP 


137 




TIB 


:TYR 


138 



N, 


CA, 


c 


f\ 

' ^ 1 


r 1 








XT 


CA t 


c 




CB, 








XT 

N, 


? CA, 


c 




rCB, 


Cw , CU , UCi J. / niliZ 






N ( 


CA, 


c 




CB, 


SG 






N( 


, CA, 


c 


rO 










Nj 


CA, 




0, 


,CB, 


CG,ODl,NU<2 






N, 


CA, 


c, 


0, 


CB, 


CG , 0D1 , ND2 






N, 


CA, 


c 




CB, 


CG , OD1 , OU2 






N, 


CA, 


c. 




CB 








N, 


CA, 


CD,C,0, 


Co , CG 






XT 


CA, 


c 


0, 


CB 








XT 

N, 


CA, 


c, 


-0, 


,CB, 


r\r* -1 r*r* 0 
UG1 , CG<2 






XT 

N , 


CA, 


c, 


0, 


CB, 








XT 
N, 


CA, 


c, 


O, 


CB, 


CG f UUJ. , NUZ 






N, 


CA, 


c, 


rO, 


rCB, 








N, 


CA, 


CD,C,0, 


CB , CG 






N, 


CA, 


c, 


Oi 


CB, 


CGI, CG2 






N, 


CA, 


c 


Oi 


CB, 


CG , CD , OE1 , 0E2 






N, 


CA, 


c, 


0, 


CB, 


CG,CD,CE,NZ 






N, 


CA, 


c 


0, 


CB 








N, 


CA, 


c, 


0, 


CB, 


CG , CD1 , CD2 , CE1 , 


CE2 


f cz 


Nj 


CA, 


c 


0, 


CB, 


CG , CD1 r CD2 






N| 


CA, 


c, 


0, 


CB, 


CG , CD1 f CD2 , CE1 , 






N, 


CA, 


c, 


0, 


CB, 


OG 






XT 
N j 


CA, 


c, 


0, 


CB, 


C(j , tUl , \~\J£. f C£iX / 








CA, 


c, 


0, 


CB, 








XT 

N , 


r CA, 


c, 


0, 


CB, 


CV3 , CiJi. , CUZ 






N, 


CA, 


c, 


0, 


CB, 


CG , OL)l f OUz 






XT 

ri , 


CA, 


c, 


0, 


CB, 








N, 


CA, 


c, 


0, 


CB, 


CG,CD,CE,NZ 






N, 


CA, 


c, 


0, 


CB, 


CG , CD1 f CD2 






N, 


CA, 


c, 


0, 


CB, 


CG, 0D1,ND2 






N, 


CA, 


c, 


0, 


CB, 


CG,CD1,CD2,CE1, 


CE2 


,cz 


N, 


,CA, 


c, 


0, 


CB, 


CG,ODl,OD2 






N, 


CA, 


c, 


,0, 


CB, 


CG,CD1,CD2 






N, 


CA, 


c, 


o, 


CB, 


CG,CD,CE,NZ 






N, 


rCA, 


c, 


0, 


CB, 


CG,CD,0E1,0E2 







N,CA,C, 


0,CB,CG,CD,NE,CZ,NH1 


,NH2 


N,CA,C, 


0 




N,CA,C, 


0,CB,CG,ND1 / CD2,CE1, 


NE2 


N,CA,C, 


0,CB,CG,ODl,OD2 




N,CA,C, 


O 




N,CA,C, 


0,CB,0G1,CG2 




N,CA,C, 


0,CB,0G 




N,CA,C, 


0,CB,CG,CD,NE,CZ,NH1 


,NH2 


N,CA,C, 


0,CB,CG,0D1,0D2 




N,CA,C, 


0,CB,0G1,CG2 




N,CA,C, 


0,CB,CG,CD1,CD2 




N,CA,C ( 


0,CB,CG,CD,NE,CZ,NH1 


,NH2 


N,CA,C, 


0,CB,CG,CD,0E1,NE2 




N, CA, C, 


r 0,CB,CGl,CG2 




N,CA,C 


0,CB,CG,CD,0E1,0E2 




N,CA,C, 


0,CB,CG,ODl,OD2 




N,CA,C 


r 0,CB 




N,CA,C 


O , CB , CG , ND 1 , CD2 , CE1 , 


NE2 


N,CA,C 


r 0,CB,CG,0Dl,0D2 




N,CA,C 


, O , CB , CG , CD1 , CD 2 , CE1 , 


CE2,CZ,OH 
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TIB : THR 186:N,CA 

TIB : GLN 188:N,CA 

TIB: THR 192:N,CA 

TIB: LEU 193:N,CA 
5 TIB: TYR 194:N,CA 

TIB : ARG 195:N,CA 

TIB: TYR 213:N,CA 

TIB : SER 217:N,CA 

TIB: PRO 218:N / CA 
10 TIB : GLU 219:N # CA 

TIB : LYS 223:N,CA 

TIB: SER 224:N,CA 

TIB : VAL 230:N,CA 

TIB: THR 231:N,CA 
15 TIB:ASP 234:N,CA 

TIB: ILE 235:N,CA 

TIB: ILE 238:N,CA 

TIB : GLU 239:N,CA 

TIB : GLY 240:N,CA 
20 TIB: GLY 245:N,CA 

TIB: LEU 269:N,CA 
Subset ACT SITE: 

actsiteraole. list 
Subset ACTSITE: 

25 TIB: 17 ,21,80-87 , 89-90 , 113 , 143-153 , 170-176 , 196-206 ,221- 

222,226, 246-249, 
TIB: 251-2 61, 263-267 
actsiteatom. list 
Subset ACTSITE: 
30 TIB: SER 17:N,CA,C,0,CB,OG 

TIB: TYR 21:N,CA,C / 0,CB,CG,CD1,CD2 / CE1,CE2,CZ / 0H 

TIB : PHE 80:N,CA,C,O,CB,CG,CDl,CD2,CEl,CE2,CZ 

TIB: ARG 81: N,CA, C,0, CB, CG, CD,NE,CZ ,NH1 , NH2 

TIB : GLY 82:N,CA,C,0 
35 TIB : SER 83 :N,CA,C,0,CB,OG 

TIB: ARG 84 :N,CA,C,0,CB, CG, CD,NE,CZ,NH1, NH2 

TIB : SER 85:N,CA,C,0,CB,OG 

TIB: ILE 86:N,CA,C,0,CB,CG1,CG2,CD1 

TIB: GLU 87:N,CA,C,0,CB,CG,CD,OEl,OE2 
40 TIB:TRP 89:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3,CZ2,CZ3,CH2 

TIB: ILE 90 :N,CA, C,0, CB, CGI, CG2 , CD1 

TIB : PHE 1 1 3": N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 

TIB : THR 143 :N, CA, C,0, CB, OG1, CG2 

TIB:GLY 144:N,CA,C,0 
45 TIB:HIS 145:N,CA / C,0,CB,CG,ND1,CD2,CE1 / NE2 

TIB : SER 146:N,CA,C,0,CB,OG 

TIB: LEU 147:N,CA,C / 0,CB,CG,CD1,CD2 

TIB:GLY 148:N,CA / C,0 

TIB : GLY 149:N,CA,C,0 
50 TIB : ALA 150:N,CA,C,O,CB 

TIB : LEU 15 1 : N , CA , C , O , CB , CG , CD1 , CD2 

TIB : ALA 152 :N,CA,C,0, CB 

TIB : THR 153:N,CA,C / 0,CB,0G1,CG2 

TIB : SER 170:N,CA,C,O,CB,OG 
55 TIB : TYR 171 :N, CA, C, O, CB, CG, CD1, CD2 , CE1 , CE2 , CZ, OH 

TIB : GLY 172:N,CA,C,0 

TIB: ALA 173 :N, CA, C,0, CB 



C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
CD,C,0, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 
C,0,CB, 

c,o 

C,0 

C,0,CB,0XT,CG,CD1,CD2 



CG,CD / 0E1,NE2 
0G1,CG2 
CG, CD1, CD2 

CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
CG,CD,NE,CZ,NH1,NH2 
CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
OG 

CB,CG 

CG,CD,OEl,OE2 
CG,CD,CE,NZ 
OG 

CG1,CG2 
OG1 , CG2 

CG,ODl,OD2 1 
CG1,CG2,CD1 
CG1,CG2,CD1 
CG,CD,OEl,OE2 
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TIB 


:PRO 


174 :N f CA, CD, 0,0/06,06 






TIB 


: ARG 


175:M,CA,C 


r O,CB, CG, CD, NE,CZ,NH1,NH2 




TIB 


:VAL 


176:N,CA,C 


r 0,CB,CGl,CG2 






TIB 


:ILE 


196:N,CA,C 


r 0,CB,CGl,CG2,CDl 




5 


TIB 


:THR 


197:N,CA,C 


,0,08,001,002 






TIB 


:HIS 


198:N,CA,C 


r 0,CB,CG,NDl,CD2,CEl, 


NE2 




TIB 


:THR 


199:N,CA,C 


,0,CB,0G1,CG2 






TIB 


:ASN 


200:N,CA,C 


,0,CB,CG,0D1,ND2 






TIB 


:ASP 


201:N,CA,C 


r O,CB,CG,ODl,OD2 




10 


TIB 


:ILE 


202:N,CA,C 


r O,CB,CGl,CG2,CDl 






TIB 


:VAL 


203 :N,CA,C 


,0,06,001,002 






TIB 


:PRO 


204:N,CA,CD,C,O,CB,CG 






TIB 


: ARG 


205:N,CA,C 


f O,CB,CG,CD,NE,CZ,NHl 


,NH2 




TIB" 


:LEU 


206:N,CA,C 


0,CB,CG,CD1,CD2 




15 


TIB 


:TRP 










221: 


N, CA,C,0,CB,CG, 


CD1 , CD2 , NE1 , CE2 , CE3 , 


CZ2,CZ3 




TIB 


:ILE 


222:N,CA,C 


r 0,CB,CGl,CG2,CDl 






TIB 


:THR 


226:N,CA,C 


r O,CB,OGl,CG2 






TIB 


:GLY 


246:N,CA,C 


O 




20 


TIB 


:ASN 


247:N,CA,C 


r O,CB,CG,ODl,ND2 






TIB 


:ASN 


248:N,CA,C 


0,CB,CG,0D1,ND2 






TIB 


: GLN 


249 :N,CA,C 


0,CB,CG,CD,0E1,NE2 






TIB 


:ASN 


251:N,CA,C / 


0,CB,CG,0D1,ND2 






TIB 


:ILE 


252:N,CA,C ( 


0,CB,CG1,CG2,CD1 




25 


TIB 


:PRO 


2 5 3 : N , OA , CD ,0,0, CB , CG 






TIB 


:ASP 


254:N,CA,C ( 


0,CB,CG,0D1,0D2 






TIB 


:ILE 


255:N,CA,C 


0,CB,CG1,CG2,CD1 






TIB 


:PRO 


256:N,CA,CD,C,0,CB,CG 






TIB 


: ALA 


257:N,CA,C ( 


0,CB 




30 


TIB 


:HIS 


258:N,CA,C, 


O,CB,CG,NDl,CD2,0El, 


NE2 




TIB: 


:LEU 


259:N,CA,C, 


O,CB,CG,CDl,0D2 






TIB 


:TRP 










260: 


N, CA,C.O,CB, CG, 


CD1 , CD 2 , NE1 , CE2 , CE3 , 


CZ2,CZ3 




TIB' 


:TYR 


261:N,CA,C, 


O,CB,CG,CDl,CD2,0El, 


CE2,CZ, 


35 


TIB 


:GLY 


263 :N,CA,C, 


O 






TIB: 


:LEU 


264 :N,CA,C, 


0,CB,CG,CD1,CD2 






TIB 


:ILE 


265:N,CA,C, 


0,CB,CG1,CG2,CD1 






TIB: 


:GLY 


266:N,CA,C, 


0 






TIB 


:THR 


267:N,CA,C, 


0,CB,0G1,CG2 





40 Subset RESTX: 

restxmole . list 



Subset RESTX r 

NEWMODEL: 14, 16, 18-20, 31-34,3 6,38,40,48-50,56-66,68,78- 
79,88,91-93, 

45 NEWMODEL: 104-106, 120, 136,225,227-229,250,262,268 
restxatom. list 
Subset RESTX: 

NEWMODEL: ALA 14 : N, OA, C, O , CB 

NEWMODEL : TYR 16 : N , OA , C, O , CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ , OH 
50 NEWMODEL : ALA 18 : N, OA, C, O , CB 

NEWMODEL : ALA 19 : N, CA, C, O, CB 

NEWMODEL : ALA 20:N,CA,C,O,CB 

NEWMODEL : GLY 31:N,CA,C,0 

NEWMODEL : THR 32 : N, CA, C, O, CB, 0G1 , CG2 
55 NEWMODEL: ASN 33 : N, CA, C, O , CB, CG, 0D1 , ND2 

NEWMODEL: ILE 34 : N, CA, C, O , CB, CGI , CG2 , CD1 

NEWMODEL: CYS 36 : N, CA, C, O , CB , SG 
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10 



15 



20 



25 



30 



35 



NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 
NEWMODEL 



GLY 
ALA 
ASP 
ALA 
THR 
GLU 
ASP 
SER 
GLY 
VAL 
GLY 
ASP 
VAL 
THR 
GLY 
PHE 
ALA 
LEU 
SER 
ASN 
GLY 
ASN 
LEU 
CYS 
SER 
GLY 
VAL 
PRO 
GLY 
LEU 
VAL 
PRO 
PRO 
PHE 
CYS 



ID 

J o 


■ M P& P 




a n 


■ u p& p 




*t O 


u p& p 


n PR PG. ODl OD2 


A Q 


» m pa p 


a pr 


50 


• N CA C 


O CB OG1 CG2 


56 


• N CA C 


O , CB , CG , CD , 0E1 , OE2 


57 


' N CA C 


O , CB , CG , ODl , OD2 


58 


N CA C 


O . CB . OG 


59. 


: N . CA . C . 


o 


60 


• N CA C 


O , CB , CGI , CG2 




N CA C 


o 


6? 


N CA C 


O CB CG. ODl. OD2 




M Pl\ P 


O PR CGI CG2 




» "W Pl\ P 


D PR OGl CG? 


65 


N , CA , C , 


0 


66 


>N,CA,C, 


O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ 


68 


N,CA,C, 


0,CB 


78. 


N,CA,C, 


0,CB,CG,CD1,CD2 


79 


N,CA,C, 


0,CB,OG 


88" 


N,CA,C, 


0,CB,CG,ODl,ND2 


91. 


N,CA,C, 


O 


92! 


N,CA,C, 


0 , CB , CG , ODl , ND2 


93: 


N,CA,C, 


0 f CB , CG , CD1 , CD2 



104:N,CA,C,O / CB,SG 

105:N,CA,C,O,CB,OG 

106:N r CA r C r O 

120:N,CA,C,O,CB,CGl,CG2 

136 :N, CA, CD, C, O, CB, CG 

225:N,CA,C,0 

227:N,CA,C,0,CB,CG,CD1,CD2 
228 IN/CA, 0,0,08,061,002 
229:N,CA,CD,C,0,CB,CG 
250:N, CA, CD, C, O, CB, CG 

2 6 2 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1, CE2 , CZ 
268:N,CA,C,0,CB,SG 



Example 10 

Providing a lipase variant E87K+D254K 

The Humicola lanuginosa lipase variant E87K+D254K was 
4 0 constructed", expressed and purified as described in WO 
92/05249. 



Example 11 

Lipase-S-PEG 15,000 conjugate 
45 The lipase variant E87K+D254K-SPEG conjugate was prepared as 
described in Example 7, except that the enzyme is the Humicola 
lanuginosa lipase variant (E87K+D254K) described in Example 10 
and the polymer is mPEG15,000. 



50 Example 12 
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Immunoaenecity assessed as IgG j of lipase variant (D87K+D254K) in 
Balb/C mice 

Balb/c mice were immunized by subcutanuous injection of: 

i) 50 ul 0.9% (wt/vol) NaCl solution (control group, 8 mice) 
5 (control), 

ii) 50nl 0.9% (wt/vol) NaCl solution containing 25 ng of protein 
of a Humlcola lanuginosa lipase variant (E87K+D254K) (group 1, 

8 mice) (unmodified lipase variant) , 

iii) 50% 0.9% (wt/vol) NaCl solution containing a Humicola 

10 lanugoinosa lipase variant substituted in position D87K+D254K and 

coupled to a N-succinimidyl carbonate activated mPEG 15, 000 (group 

2, 8 mice) ( lipase-SPEG15, 000) . 

The amount of protein for each batch was measured by optical 

density measurements. Blood samples (200 ul) were collected 
15 from the eyes one week after the immunization, but before the 

following immunization. Serum was obtained by blood clothing, 

and centrif ugation. 

The IgGi response was determined by use of the Balb/C mice 

Igd ELISA method as described above. 
20 Results : 

Five weekly immunizations were required to elicit a 
detectable humoral response to the unmodified Humicola 
lanuginosa variant. The antibody titers elicited by the 
conjugate (i.e. lipase-SPEG15, 000 ranged between 960 and 1920, 
25 and were only 2 to 4x lower than the antibody titer of 3840 
that was elicited by unmodified HL82-Lipolase (figure to the 
left) . - 

The results of the tests are shown in Figure 1 

As will be apparent to those skilled in the art, in the light 
3 0 of the foregoing disclosure, many alterations and modifications 
are possible in the practice of this invention without departing 
from the spirit or scope thereof. Accordingly, the scope of the 
invention is to be construed in accordance with the substance 
defined by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: Novo Nordisk A/S 

<B) STREET: Novo Alle 

(C) CITY: Bagsveard 
(E) COUNTRY: Denmark 

<F) POSTAL CODE (ZIP) : DK-2880 
10 (G) TELEPHONE: +45 4444 8888 

<H) TELEFAX: +45 4449 3256 

(ii) TITLE OF INVENTION: A modified polypeptide 

(iii) NUMBER OF SEQUENCES: 9 
(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EP0) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus sp. PD498, NCIMB No. 40484 
(ix) FEATURE: 
30 (A) NAME /KEY: CDS 

(B) LOCATION : 1 . . 840 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGG TCA CCG AAT GAC CCT TAC TAT TCT GCT TAC CAG TAT GGA CCA CAA 48 
35 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

AAC ACC TCA ACC CCT GCT GCC TGG GAT GTA ACC CGT GGA AGC AGC ACT 96 
Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
40 20 25 30 

CAA ACG GTG GCG GTC CTT GAT TCC GGA GTG GAT TAT AAC CAC CCT GAT 144 
Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



45 



CTT GCA AGA AAA GTA ATA AAA GGG TAC GAC TTT ATC GAC AGG GAC AAT 192 
Leu Ala Arg Lys Val lie Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
50 55 60 



50 AAC CCA ATG GAT CTT AAC GGA CAT GGT ACC CAT GTT GCC GGT ACT GTT 240 
Asn Pro-Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

GCT GCT GAT ACG AAC AAT GGA ATT GGC GTA GCC GGT ATG GCA CCA GAT 288 
55 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 
85 90 95 

ACG AAG ATC CTT GCC GTA CGG GTC CTT GAT GCC AAT GGA AGT GGC TCA 336 
Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
60 100 105 110 

CTT GAC AGC ATT GCC TCA GGT ATC CGC TAT GCT GCT GAT CAA GGG GCA 384 
Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 

65 

AAG GTA CTC AAC CTC TCC CTT GGT TGC GAA TGC AAC TCC ACA ACT CTT 432 
Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
130 135 140 



70 AAG AGT GCC GTC GAC TAT GCA TGG AAC AAA GGA GCT GTA GTC GTT GCT 



480 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

GCT GCA GGG AAT GAC AAT GTA TCC CGT ACA TTC CAA CCA GCT TCT TAC 528 
5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

CCT AAT GCC ATT GCA GTA GGT GCC ATT GAC TCC AAT GAT CGA AAA GCA 576 
Pro Asn Ala lie Ala Val Gly Ala lie Asp Ser Asn Asp Arg Lys Ala 
10 180 185 190 

TCA TTC TCC AAT TAC GGA ACG TGG GTG GAT GTC ACT GCT CCA GGT GTG 624 
Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 



15 



AAC ATA GCA TCA ACC GTT CCG AAT AAT GGC TAC TCC TAC ATG TCT GGT 672 
Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
210 215 220 



20 ACG TCC ATG GCA TCC CCT CAC GTG GCC GGT TTG GCT GCT TTG TTG GCA 720 
Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

AGT CAA GGT AAG AAT AAC GTA CAA ATC CGC CAG GCC ATT GAG CAA ACC 768 
25 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

GCC GAT AAG ATC TCT GGC ACT GGA ACA AAC TTC AAG TAT GGT AAA ATC 816 
Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
30 260 265 270 

AAC TCA AAC AAA GCT GTA AGA TAC 840 
Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 280 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

45 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
20 25 30 

Glir'Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



50 



Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
55 50 55 60 

Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

60 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 
85 90 95 



65 



Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
100 105 110 

Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 



Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
70 130 135 140 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

Pro Asn Ala lie Ala Val Gly Ala lie Asp Ser Asn Asp Arg Lys Ala 
180 185 190 

10 

Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 

Asn lie Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
15 210 215 220 

Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

20 Ser Gin Gly Lys Asn Asn Val Gin lie Arg Gin Ala He Glu Gin Thr 
245 250 255 

Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
260 265 270 

25 

Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

40 Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

15 10 15 



45 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

55 Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 



60 



Ser Gly Ser Gly Ser Val Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
65 130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Gly Ser He Ser 
145 150 155 160 

70 Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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165 170 175 

Asn Asn Asn Arg Ala Ser Phe S r Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) ~ TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Arthromyces ramosus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Gin Gly Pro Gly Gly Gly Gly Gly Ser Val Thr Cys Pro Gly Gly Gin 
15 10 15 

Ser Thr Ser Asn Ser Gin Cys Cys Val Trp Phe Asp Val Leu Asp Asp 
20 25 30 

Leu Gin Thr Asn Phe Tyr Gin Gly Ser Lys Cys Glu Ser Pro Val Arg 
35 40 45 

Lys lie Leu Arg lie Val Phe His Asp Ala He Gly Phe Ser Pro Ala 
50 55 60 

Leu Thr Ala Ala Gly Gin Phe Gly Gly Gly Gly Ala Asp Gly Ser He 
65 70 75 80 

He Ala His Ser Asn He Glu Leu Ala Phe Pro Ala Asn Gly Gly Leu 
85 90 95 

Thr Asp Thr He Glu Ala Leu Arg Ala Val Gly He Asn His Gly Val 
100 105 110 

Ser Phe Gly Asp Leu He Gin Phe Ala Thr Ala Val Gly Met Ser Asn 
115 120 125 

Cys Pro Gly Ser Pro Arg Leu Glu Phe Leu Thr Gly Arg Ser Asn Ser 
130 135 140 

Ser Gin Pro Ser Pro Pro Ser Leu He Pro Gly Pro Gly Asn Thr Val 
145 150 155 160 

Thr Ala He Leu Asp Arg Met Gly Asp Ala Gly Phe Ser Pro Asp Glu 
165 170 175 

Val Val Asp Leu Leu Ala Ala His Ser Leu Ala Ser Gin Glu Gly Leu 
180 185 190 

Asn Ser Ala He Phe Arg Ser Pro Leu Asp Ser Thr Pro Gin Val Phe 
195 200 205 
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Asp Thr Gin Phe Tyr lie Glu Thr Leu Leu Lys Gly Thr Thr Gin Pro 
210 215 220 

Gly Pro Ser Leu Gly Phe Ala Glu Glu Leu Ser Pro Phe Pro Gly Glu 
5 225 230 235 240 

Phe Arg Met Arg Ser Asp Ala Leu Leu Ala Arg Asp Ser Arg Thr Ala 
245 250 255 

10 Cys Arg Trp Gin Ser Met Thr Ser Ser Asn Glu Val Met Gly Gin Arg 

260 265 270 



15 



Tyr Arg Ala Ala Met Ala Lys Met Ser Val Leu Gly Phe Asp Arg Asn 
275 280 285 

Ala Leu Thr Asp Cys Ser Asp Val lie Pro Ser Ala Val Ser Asn Asn 
290 295 300 



Ala Ala Pro Val lie Pro Gly Gly Leu Thr Val Asp Asp He Glu Val 
20 305 310 315 320 

Ser Cys Pro Ser Glu Pro Phe Pro Glu He Ala Thr Ala Ser Gly Pro 
325 330 335 

25 Leu Pro Ser Leu Ala Pro Ala Pro 

340 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa DSM 4109 
(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 1. .66 
40 (ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 67. .876 
(ix) FEATURE: 

(A) NAME/KEY: CDS 
45 (B) LOCATION: 1. .876 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AGG AGC TCC CTT GTG CTG TTC TTT GTC TCT GCG TGG ACG GCC TTG 48 
Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser- Ala Trp Thr Ala Leu 
50 -22 -20 -15 -10 

GCC AGT CCT ATT CGT CGA GAG GTC TCG CAG GAT CTG TTT AAC CAG TTC 96 
Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 15 10 

55 

AAT CTC TTT GCA CAG TAT TCT GCA GCC GCA TAC TGC GGA AAA AAC AAT 144 
Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 

60 GAT GCC CCA GCT GGT ACA AAC ATT ACG TGC ACG GGA AAT GCC TGC CCC 192 
Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

GAG GTA GAG AAG GCG GAT GCA ACG TTT CTC TAC TCG TTT GAA GAC TCT 240 
65 Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
45 50 55 

GGA GTG GGC GAT GTC ACC GGC TTC CTT GCT CTC GAC AAC ACG AAC AAA 288 
Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
70 60 65 70 
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TTG ATC GTC CTC TCT TTC CGT GGC TCT CGT TCC ATA GAG AAC TGG ATC 336 
Leu lie Val Leu Ser Phe Arg Gly Ser Arg Ser lie Glu Asn Trp lie 
75 80 85 90 

5 

GGG AAT CTT AAC TTC GAC TTG AAA GAA ATA AAT GAC ATT TGC TCC GGC 384 
Gly Asn Leu Asn Phe Asp Leu Lys Glu lie Asn Asp lie Cys Ser Gly 
95 100 105 

10 TGC AGG GGA CAT GAC GGC TTC ACT TCG TCC TGG AGG TCT GTA GCC GAT 432 
Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

ACG TTA AGG CAG AAG GTG GAG GAT GCT GTG AGG GAG CAT CCC GAC TAT 480 
15 Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
125 130 135 

CGC GTG GTG TTT ACC GGA CAT AGC TTG GGT GGT GCA TTG GCA ACT GTT 528 
Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
20 140 145 150 

GCC GGA GCA GAC CTG CGT GGA AAT GGG TAT GAT ATC GAC GTG TTT TCA 576 
Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 

25 

TAT GGC GCC CCC CGA GTC GGA AAC AGG GCT TTT GCA GAA TTC CTG ACC 624 
Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 

30 GTA CAG ACC GGC GGA ACA CTC TAC CGC ATT ACC CAC ACC AAT GAT ATT 672 
Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

GTC CCT AGA CTC CCG CCG CGC GAA TTC GGT TAC AGC CAT TCT AGC CCA 720 
35 Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
205 210 215 

GAG TAC TGG ATC AAA TCT GGA ACC CTT GTC CCC GTC ACC CGA AAC GAT 768 
Glu Tyr Trp lie Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
40 220 225 230 

ATC GTG AAG ATA GAA GGC ATC GAT GCC ACC GGC GGC AAT AAC CAG CCT 816 
He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 

45 

AAC ATT CCG GAT ATC CCT GCG CAC CTA TGG TAC TTC GGG TTA ATT GGG 864 
Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 

50 ACA TGT CTT TAG 876 
Thr Cys Leu * 
270 

(2) INFORMATION FOR SEQ ID NO: 6: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
-22 -20 -15 -10 

65 Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 15 10 



Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 

70 
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Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
5 45 50 55 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
60 65 70 

10 Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

15 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
20 125 130 135 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
140 145 150 

25 Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 

30 

Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
35 205 210 215 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
220 225 230 

40 He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 

Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 

45 

Thr Cys Leu * 
270 

50 (2) INFORMATION FOR SEQ ID NO; 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R28K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

60 gggatgtaac caagggaagc agcactcaaa eg 32 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R62K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
5 cgactttatc gataaggaca ataaccc 

<2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R169K oligo" 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



27 



caatgtatcc aaaacgttcc aaccagc 



27 
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Patent Claims 

1. A polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
5 polypeptide, having been modified in a manner to increase the 

number of attachment groups on the surface of the polypeptide, in 
comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/or 

b) one or more fewer polymeric molecules coupled to the 
10 polypeptide, having been modified in a manner to decrease the 

number of attachment groups at or close to the functional site(s) 
of the polypeptide, in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

2. The conjugate according to claims 1, having 1 to 25, 
15 preferably 1 to 10 additional polymeric molecules coupled to the 

surface of the polypeptide in comparison to the number of 
polymeric molecules of a conjugate prepared from the corresponding 
parent enzyme. 

3. The conjugate according to claims 1 and 2, wherein the 
20 additional attachment group (s) is (are) amino groups in the form of 

Lysine residues (s), or carboxylic groups in the form of Aspartic 
acid or Glutamic acid residues. 

4. The conjugate according to any of claims l to 3, wherein 
the additional attachment group (s) is (are) prepared by a 

25 conservative substitution of an amino acid residue, such as an 
Arginine to Lysine substitution. 

5. The conjugate according to claims 1 to 3, wherein the 
additional attachment group (s) is (are) prepared by a conservative 
substitution of an amino acid, such as an Aspargine to 

30 Aspartate/Glutamate or a Glutamine to Aspartate/Glutamate 
substitution. 

6. The conjugate according to any of claims 1 to 5, wherein 
the added attachment group is located more than 5 A, preferably 8 
A, especially 10 A from the functional site. 

35 7. The conjugate according to claim 1, having 1 to 25 

preferably 1 to 10 fewer polymeric molecules coupled at or close 
to the functional site of the polypeptide in comparison to the 
number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 
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8. The conjugate according to claim 7, wherein the removed 
attachment group (s) is (are) amino groups in the form of Lysine 
residues (s) , or carboxylic groups in the form of Aspartic acid or 
Glutamic acid residues. 
5 9. The conjugate according to any of claims 7 and 8, wherein 

the removed attachment group (s) is (are) prepared by a conservative 
substitution of an amino group, such as Lysine to Arginine 
substitution. 

10. The conjugate according to any of claims 7 to 8, wherein 
10 the removed attachment group (s) is (are) prepared by a conservative 

substitution of a carboxylic group, such as an Aspartate/Glutamate 
to Aspargine or Aspartate/Glutamate to a Glutamine substitution. 

11. The conjugate according to any of claims 1 to 10, wherein 
the removed attachment group is located within 5 A, preferably 8 

15 A, especially 10 A from the functional site. 

12. The conjugate according to any of claims 1 to 11, wherein 
the attachment groups are broadly spread. 

13. The conjugates according to claims 1 to 12, wherein the 
parent polypeptide moiety of the conjugate has a molecular weight 

20 from 1 to 100 kDa, preferred 15 to 100 kDa. 

14. The conjugate according to claim 13, wherein the parent 
polypeptide moiety of the conjugate has a molecular weight of from 
l to 35 kDa. 

15. The conjugates according to claim 14, wherein the parent 
25 polypeptide is an enzyme selected from the group of 
Oxidoreductases, including laccases and Superoxide dismutase 
(SOD) ; Hydrolases, including proteases, especially subtilisins, 
and lipolytic enzymes; Transferases, including Transglutaminases 
(TGases) ; Isomerases, including Protein disulfide Isomerases 
30 (PDI) . 

16. The conjugate according to claim 15, wherein the parent 
enzyme is PD498, Savinase®, BPN', Proteinase K, Proteinase R, 
Subtilisin DY, Lion Y, Rennilase®, JA16, Alcalase® or a Humicola 
lanuginosa, lipase, such as Lipolase®. 
35 17. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a PD498 variant with one or more of the 
following substitutions: R51K, R62K, R121K, R169K, R250K, R28K, 
R190K, P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, 
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G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, S220K, 
Y221K, G262K. 

18. The conjugate according to claim 17 , with one of the 
following mutations: R28K+R62K, R28K+R169K, R62K + R169K, 
5 R28K+R69K+R169K. 

19. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Savinase® variant with one or more of 
the following substitutions: R10K, R19K, R45K, R145K, R170K, 
R186K, R247K, K94R, P5K, P14K, T22K, T38K, H39K, P40K, L42K, 

10 L75K, N76K, L82K, P86K, S103K, V104K, S105K, A108K, A133K, 
T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, G211K, S212K, 
T213K, A215K, S216K, N269K. 

20. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Humicola lanuginosa lipase variant 

15 with one or more of the following substitutions: 

R133K,R139K,R160K,R179K,R209K / R118K,R125K,A18K,G31K,T32K, 
N33K / G38K,A40K,D48K,T50K,E56K / D57K,S58K,G59K,V60K,G61K,D62K, 
T64K,L78K,E87K,N88K,G91K,N92K,L93K,S105K / G106K / V120K,P13 6K,G225 
K,L227K,V228K,P229K,P250K / D254K,F262K. 

20 21. The conjugate according to claim 20 with the following 

mutations E87K+D254K. 

22. The conjugate according to any of claims 1 to 21, wherein 
the polymeric molecules coupled to the polypeptide have a 
molecular weight from 1 to 60 kDa, especially 1-35 kDa, especially 

25 3 to 25 kDa. 

23. The conjugate according to claim 22, wherein the poly- 
meric molecule is selected from the group comprising a natural or 
synthetic homo- and heteropolymers , selected from the group of the 
synthetic polymeric molecules including Branched PEGs, poly-vinyl 

30 alcohol (PVA) , poly-carboxyl acids, poly-(vinylpyrolidone) and 
poly-D,L-amino acids, or natural occurring polymeric molecules 
including dextrans, including carboxymethyl-dextrans , and 
celluloses such as methylcellulose, carboxymethylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, and 

35 hydro ly sates of chitosan, starches, such as hydroxyethyl-starches, 
hydroxypropyl-starches, glycogen, agarose, guar gum, inulin, 
pullulans, xanthan gums, carrageenin, pectin and alginic acid. 

24. A method for preparing improved polypeptide-polymer 
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conjugates comprising the steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
5 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
10 selected in step b) at or close to the functional site, 

d) coupling polymeric molecules to the mutated polypeptide. 

25. The method according to claim 24, wherein the 
identification of amino acid residues located on the surface on 
the polypeptide referred to in step a) are performed by a computer 

15 program analyzing the 3D structure of the parent polypeptide in 
question. 

26. The method according to claim 24, wherein step b) 
comprises selecting Arginine or Lysine residues on the surface of 
the parent polypeptide. 

20 27. The method according to claim 24, wherein one or more 

Arginine residues identified in step b) is (are) substituted with a 
Lysine residue (s) in step c) . 

28. The method according to claims 27, wherein the 
substituted Arginine residues have a distance of more than 5 A, 

25 preferably 8 A , especially 10 A from the functional site. 

29. The method according to any of claims 24 to 28, wherein 
the polypeptide prepared in step . c) is coupled to polymeric 
molecules . 

30. Use of the conjugate in claims 1 to 23 for reducing the 
30 allergenicity of industrial products. 

31. Use of the conjugate in claims 1 to 23 for reducing the 
immunogenicity of pharmaceuticals. 

32. A composition comprising a conjugate of any of claims 1 
to 23 and further comprising ingredients used in industrial 

35 products. 

33. The composition according to claim 32, wherein the 
industrial product is a detergent, such as a laundry, dish wash or 
hard surface cleaning product, or a food or feed product. 
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34. The composition according to claim 32, comprising a 
conjugate of any of claims 1 to 22 and further ingredients used in 
skin care products. 

35. A composition comprising a conjugate of any of claims 1 
to 23 and further comprising ingredients used in pharmaceuticals. 
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